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MODIFIED SEED STORAGE PROTEINS 
5 This invention relates to plant seed storage 

proteins, and in particular to methods of modifying said 
proteins at specific sites. 

Background of the Invention 

Over the last few years there has been dramatic 

10 progress in the development of gene transfer systems for 
higher plants, and systems are now available for 
monocotyledonous plants, as well as for a variety of 
dicotyledonous plants. It is now possible not only to 
introduce foreign genes (including non-plant genes) into plant 

15 cells and tissues, and to regenerate whole plants, but also to 
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control the site in the plant where such genes will be 
expressed. Methods for introducing genes into plants, and the 
prospects which these methods offer for applying genetic 
engineering methods to improvement of crops, have recently 
5 been reviewed (C.S. Gasser and R.T. Fraley (1989) Science 244 
1293-1299); this review reflects the state of the art. 
International Patent Application No. WO/8903887 discloses a 
process for production of a mammalian peptide via expression 
of a modified seed storage protein gene in a transgenic plant. 

10 The most commonly used vector is Agrobacterium tumef aciens Ti 
plasmid, which is successful in transferring genetic 
information into a variety of dicotyledonous plants, including 
peas and beans. More recently, the plant virus geminivirus 
has been proposed as a general purpose vector for use with 

15 plants (Australian Patent Application No. 599609 (64380/86) by 
Monsanto Company and David Bisaro) 

Genetic engineering of improved grain crops has been 
hampered by the lack of detailed structural information about 
functional storage proteins, which form the major protein 

20 component of plant seeds 1 . After synthesis and processing, 

these proteins are transported to membranous organelles, known 

2-4 

as protein bodies, where they accumulate in large amounts 
Upon germination, they are degraded by endogenous proteases to 
provide nutrition for the growing plant^. Because of the high 

25 level of expression and subsequent accumulation of these 

proteins, they determine the nutritional value of the seed and 
dictate the requirement for supplements when the seed is used 
as foodstuff for man or domestic animals* 

Most of the protein found in mature seeds belongs to 

30 a class called the seed storage proteins. This term is used 
to describe those proteins whose function is to provide, upon 
germination, a source of fixed carbon and nitrogen to sustain 
the early heterotrophic growth of the seedling. However, it 
is because of the importance of seeds to human nourishment 

35 that so much work has been directed at understanding seed 
proteins. Among the best characterised of the seed proteins 
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are those from the agriculturally important legumes, soya bean 
(Gl ycine max ) , garden pea ( Pisum sativum ) and French bean 
(Phaseolus vulgaris ) , although increasing attention is being 
given to food crops of the Third World, including cow pea 
5 ( Vigna unguiculata ) , mung bean ( radiata ) and pigeon pea 
( Cajanus cajan ) . 

As a whole, legume storage proteins, which are 
mostly of the salt-soluble globulin class, are a 
nutritionally-poor source of tryptophan and the sulphur- 

10 containing amino acids, methionine and cysteine, although some 
of the other protein fractions within the seed (for example 
the albumins) may have adequate levels of these essential 
amino acids. Much effort has been directed to improving the 
understanding of storage proteins so as to be able to 

15 manipulate their genes using methods such as recombinant DNA 
techniques, tissue culture and plant breeding in such a way as 
to enhance the nutritional value of seed proteins for human 
and other monogastric animals. 

There are two major types of storage proteins in 

20 legume seeds, known respectively as vicilins and legumins, 

which are distinguishable by their sedimentation coefficients 

(7S/11S), oligomeric organisation (trimeric/hexameric) and 

polypeptide chain structure (single chain/disulphide linked" 

pair of chains) . Both types are found within an 

13 14 

25 individual storage body ' . Analysis of ammo-acid sequence 

data from these two classes suggests that they may be 

evolutionarily and structurally related 15,16 . Clear sequence 

12 

relationships have been established within the classes 
Furthermore, a sequence motif common to legumins and 

30 vicilins 15 has been found to occur twice in vicilins 16 , 
suggesting a repeated structural motif in vicilins. 

Genes encoding two vicilin-type proteins have been 
isolated from pea ( Pisum sativum ) , and transgenically 
expressed in tobacco, and their regulatory DNA sequence 

3 5 elements have been identified; these sequences respectively 
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govern the specificity of expression and level of 
transcription (Newbigin, E.J. and T.J.V. Higgins: Proc. 8th 
Australian Biotechnology Conference, Sydney 1989, 104-109) 
Attempts have been made to improve the functional 
5 properties and nutritional value of glycinin, one of the major 
storage proteins of soybean, by producing cDNAs encoding 
glycinins modified by deletion of subsequences and by addition 
of sequences encoding four continuous methionines. The 
authors have shown expression and accumulation of some of the 

10 mutant proteins in coli cells (Kim et al., 1990). 

However, their result is inconclusive, at best 
preliminary, as no attempt is made at expression in plant 
systems. Furthermore, these authors were limited in that they 
could only make changes to highly variable regions of the 

15 sequence, and then only in an ad hoc fashion. Their methods 
cannot be generally applied across a spectrum of legumins or 
vicilins. Indeed, the regions they have modified are regions 
of low homology to the vicilins, and therefore of no immediate 
application to vicilins. Nevertheless, their work does 

20 underscore that it is realistic to make modified seed storage 
proteins with improved properties according to the present 
invention. 

Another seed storage protein, arcelin, has been 
identified in certain wild forms of Phaseolus vulgaris (Romero 

25 A. et al (1986) Theor. Appl. Genet 72 123-128; Osborn et al. 
(1986) Theor. Appl. Genet. 71 847-855); these proteins are 
toxic to bruchid pests of beans, and transfer of the cloned 
arcelin gene to other bean plant confers insect resistance 
(European Patent Application No. 337750: The Plant Cell 

30 Research Institute Inc) . 

In order to enable us to engineer novel properties 
into seeds without prejudicing their performance in vivo, we 
have for the first time determined the three-dimensional 
structure of phaseolin, the major 7S storage protein from the 

35 French Bean Phaseolus vulgaris . With this structural 

information, we can select suitable sites for modification. 
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Specific protein engineering targets in this system include 

improved nutritional value, altered stability with respect to 

functionality in food systems, and general use as a high-level 

expression system. 

5 Phaseolin, a vicilin-like molecule, is a trimer of 

three similar glycosylated polypeptides of molecular weights 

around 50 000 17 . Unlike other vicilins, the phaseolin trimer 

18 

(M r about 150 000) associates into a dodecamer below pH 4.5. 

Because of the sequence and structural relationships 

10 between vicilins and legumins referred to above and further 

discussed hereinafter, we consider that phaseolin is a 

suitable model for legume 7S and lis storage proteins in 

particular, and seed storage 7S and 11S proteins in general. 

45 

Phaseolin has been expressed in transgenic plants 

15 Madison et al. have recently used the 

three-dimensional structure of the complex between trypsin and 
bovine pancreatic tissue inhibitor to predict the site of 
interaction between tissue plasminogen activator (tPA) and 
plasminogen activator inhibitor-1 (PAI-1). Although the 

20 three-dimensional structures of tPA and PAI-1 are not known, 
tPA and trypsin have sequence similarities. Using these 
similarities, the authors identified a loop, and one 
particular residue within the loop, which forms part of the 
interactive surface between PAI-1 and tPA; by changing the 

25 sequence of this loop by site-directed mutagenesis, they 
produced tPA which was less susceptible to inhibition (E.L. 
Madison, E.J. Goldsmith, R.D. Gerard, M-J.H. Gething, and J.F. 
Sambrook (1989) Nature 339 721-724). 

This strongly suggests that the three-dimensional 

30 structure of phaseolin can be used to predict the 

three-dimensional structures of other vicilins and of 
legumins . 

We have found that the polypeptides of the trimeric 
seed storage protein phaseolin comprise two 
3 5 structurally- similar units each made up of a (h -barrel and an 
K -helical domain. The ^-barrel has the "jelly-roll" folding 
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topology shown by viral coat proteins and the * -helical 
domain shows structural similarity to the helix-turn-helix 
motif found in certain DNA-binding proteins. The tetramer of 
trimers referred to above turns out to be the form of the 
5 molecule in the crystals studied here. 

Current experience suggests that protein structure 
is remarkably stable to manipulation by site-directed 
mutagenesis. Moreover, functional properties of oc^s barrel 
proteins, 8-stranded f) -barrel enzymes, and four-helix 
10 frameworks are stable to insertions and other modifications 
(reviewed by D. Ringe (1989) Nature 339. 658-9). 

Summary of the Invention 

The major barrier to modifying plants to produce 
seed storage proteins with commercially desirable properties 
15 is knowledge of the three-dimensional structure of the storage 
proteins . 

We have now determined the three-dimensional 
structure of a representative seed storage protein, phaseolin. 
Current knowledge in the construction of transgenic plants, 
20 and in protein structure and engineering, enables us to use 
this three-dimensional structure to select target sites in the 
structure for one or more of the following modifications: 

(a) introduction of point mutations; 

(b) deleting or inserting sequences; 
25 (c) adding glycosylation sites; 

(d) introducing disulphide bonds. 
Thus, based on the three-dimensional structure 
disclosed herein, and experimental protocols known to persons 
skilled in the art, the following modifications can be made: 
30 (i) Introduction of new N-linked glycosylation 

sites for altering the stability and/or 
solubility of the protein; 
(ii) Substitution of internal amino acids by 

methionine to raise the sulphur content of the 
35 seed; 
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(iii) Truncation of protease-susceptible loops on 
the structure to improve stability; 

(iv) Introduction of protease-labile loops to 
improve the digestive properties; 

5 (v) Introduction of disulphide bonds or other 

stabilising mutations to protect the storage 
protein structure on passage through the rumen 
to improve the nutritional value of the 
protein for ruminants; 
10 (vi) Introduction of heterologous protein or 

peptide sequences into the structure to enable 
their production in crops; and 
(vii) Modifications to the sequence to enhance 

resistance of the seeds or products thereof to 
15 pests such as insects. 

Aspects of these properties tnat are relevant in 
food systems include: 

(a) stability and pH solubility for the large 

scale extraction of the proteins from seeds; 
20 (b) acid solubility for protein enriched 

beverages; 

(c) thermostability for heat-setting properties in 
snack foods; and 

(d) amino acid composition for nutritional 
25 balance. 

In one aspect, the present invention provides a 
mutein which is a variant of a naturally-occurring legume seed 
storage protein, wherein said mutein has a modified primary 
structure relative to said legume storage protein, but retains 
3 0 the tertiary and quaternary structure of said legume storage 
protein. 

For the purposes of this specification, retention of 
the tertiary and quaternary structure is to be understood to 
mean that elements of that tertiary and quaternary structure 
35 which are not the subject of primary structure modifications 
are substantially unaffected by said modifications. 
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Thus according to one preferred embodiment of the 
present invention there is provided a plant 7S or IIS storage 
protein modified at a specific amino acid residue or a 
specific region of its amino acid sequence, wherein the 
5 tertiary and quarternary structure of the naturally occurring 
storage protein is retained. 

Preferably the modification is selected from the 
group consisting of: 

(a) introduction of one or more point mutations; 
10 (b) deletion of one or more defined sequences of 

amino acids; 

(c) insertion of one or more defined sequences of 
amino acids; 

(d) introduction of one or more glycosylation 
15 sites; and 

(e) introduction of one or more disulphide bonds. 
More preferably the modification is selected from 

the group consisting of: 

(a) introduction of N-linked glycosylation sites; 
20 (b) substitution of internal amino acids by 

methionine; 

(c) truncation of protease-labile loops; and 

(d) introduction of protease-labile loops. 
According to a second aspect of the invention, there 

25 is provided a DNA molecule whose sequence encodes a mutein, as 
defined above. Preferably this DNA encodes a protein having 
the properties of a plant 7S or lis storage protein, and also 
encodes one or more of the modifications set out above. 
Plasmids, expression vectors, and microorganisms comprising 

30 said DNA are also within the scope of the invention. 

According to a third aspect of the invention there 
is provided a transgenic plant or part thereof having a DNA 
sequence as defined above. 

Preferably the plant part is a seed. 
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It will be evident to the person skilled in the art 
that the changes to the seed storage proteins provided by the 
method according to the invention will have to be made in the 
first instance at the DNA level. The modified DNA thus 
5 represents the initial embodiment of the changes; such DNA 
will be converted, via the processes of transcription and 
translation in the cell/ to yield the modified seed storage 
protein. 

Brief Description of the Drawings 

10 Figure 1 (25 sheets) represents the atomic 

co-ordinates of phaseolin in orthogonal X units. The 
three-fold symmetry axis of the phaseolin trimer is coincident 
with the Y axis of the co-ordinate frame. The coordinates of 
the C atoms have been deposited by the applicant in the 

15 Brookhaven Protein Data Bank. 

Figure 2 represents schematic diagrams showing the 
two observed patterns of interaction between the 
structural units. The absence of residues 213-219 in the 
electron density map leaves an ambiguity as to which is the 

20 correct pairing of the X+ units to form the polypeptide, 
(a) represents the preferred polypeptide (see text). In both 
diagrams the view is down the molecular three-fold axis and 
from the centre of the tetramer outwards. The pseudo-diads 
relating the structural units lie in the plane of the paper, 

25 intersecting the molecular three-fold. 

Figure 3 represents a stereo pair showing a CK 
trace of the phaseolin trimer, viewed down the molecular 
three-fold towards the centre of the tetramer. The N- and 
C-termini of each subunit are labelled. The two possible ways 

30 of linking the two &+/> units to form the subunit are 

indicated: the shorter link of 17& (solid line) is favoured 
above the longer 34A link (dashed line) . The figure was drawn 
using a computer program written by Lesk and Hardman^. 
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Figure 4 is a diagram showing the structure of 
native phaseolin (A) and the proposed high-methionine mutant 
phaseolin (B) in the neighbourhood of residues 84 to 88. The 
methionine sidechains of the mutant are seen to be 
5 accommodated in the structure without disruption to the 

tertiary or quaternary structure of the protein. For clarity, 
residues in the same /* -sheet as residues 84 to 88 are not 
shown . 

Figure 5 is a diagram showing the structure of 
10 native phaseolin (A) and the proposed high-methionine mutant 
phaseolin (B) in the neighbourhood of residues 261-265. The 
methionine sidechains of the mutant are seen to be 
accommodated in the structure without disruption to the 
tertiary and quaternary structure of the protein. For 
15 clarity, residues in the same /i -sheet as residues 261-265 are 
not shown. 

Structure Description 

X-ray diffraction data to 3& resolution were 

19 

collected from Type II crystals of phaseolin and heavy atom 
20 derivatives. The methods of analysis do not form part of this 
invention. However, various attempts to solve the structure 
by multiple isomorphous replacement and non-crystallographic 
symmetry averaging were unsuccessful over a period of 6 years, 
until a cautious approach using phase extension procedures was 
2 5 adopted. 

Structural heterogeneity 

Three different genes encoding potential protein 
products have been identified by gene sequencing 11 . These 
differ only in number of amino acids, and contain 397, 411 and 
30 412 amino acids respectively, after allowing for cleavage of a 
24 amino-acid signal sequence- The sequence and numbering 
used here is given in Table 1. 
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. . . 0000000 . 00000000 . 1 . /J0/3/300 00000000 V000 00000 
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0 00000000 0100 
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B 
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H 
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& PPPPP- . .P PG0 PPPPP PPPPP* I aaaacraa. . aacraaaacraa . . 
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• aaaaaaaaaa [ ] 



Table 1. Amino-acid sequence of the shorter (397 residue) 0-type 
phaseolin polypeptide. The line underneath the sequence contains the 
following symbols, a: regions of a-helical secondary structure, p: 
regions of £-sheet secondary structure, G: glycosylation sites, [ ]: 
regions of unobserved electron density, I: intron locations, V: 
points of major sequence insertion in other vicilin proteins. Shown 
above the sequence are the labels used to denote the 
secondary-structural elements in the text. Note that the secondary 
structure assignments are based on the current unrefined model and 
thus that the precise start and end points of the strands and helices 
may be in error by one or two amino acids in some places. 
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Compared with the 397 residue protein, the 411 
residue protein has a five amino-acid insertion after residue 
189 and a nine amino-acid insertion after residue 390. The 
412 residue protein has a further insertion of one amino acid 
5 after residue 100. The chain trace of the electron density 
map (not shown) is that of the 397 amino-acid protein (termed 
the f!> -type polypeptide 11 ) and shows no break or weakening of 
electron density in the vicinity of residue 189. We thus 
conclude that the 397 amino-acid protein is overwhelmingly 
10 dominant in the crystal. It will be clear to the person 
skilled in the art that these three genes can be readily 
interconverted, using presently available techniques. This 
invention is to be understood to apply to all three forms of 
the protein. 

15 The map shows weak evidence of glycan binding at 

24 

both known Asn-X-Ser/Thr glycosylation triplets . The 
density is not yet sufficiently clear to provide information 
about the nature of heterogeneity of the glycans. 

Folding topology 

20 Figure 2 shows a schematic drawing of the 

polypeptide. It consists of two structurally-similar units of 
about 160 amino acids each. These are related by a 
pseudo-diad axis nearly perpendicular to and intersecting the 
molecular triad axis. Each unit is itself an K + /S two-domain 

25 structure, the first of some 110 residues being a classic 
viral capsid jelly- roll structure, and the second smaller 
domain being a cluster of three helices, including a 
helix-turn-helix motif. A fourth helix in the N-terminal 
unit, which is not structurally 
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associated with the three-helix cluster, forms part of the 
connection through to the C-terminal unit. 

The jelly-roll barrel structure in phaseolin is stripped of 
nearly all of the elaboration seen in the viral caps id 
proteins 25 ' 26 ; in this respect it is similar to the barrel 
structure observed in catabolite gene activator protein 
(CAP) 27 . It is convenient to denote the eight strands of the 
jelly-roll ^-barrel as B through I, in analogy with the 
notation used in the discussion of the viral capsid proteins 25 . 
In only two places in phaseolin is there found a loop of 
structure external to the barrel, between strands F and G of 
the N-terminal barrel (residues ca 102-108) and between strands 
E and F of the C-terminal barrel (residues ca 277-286) . Each 
barrel has an A strand reminiscent of the C subunits of the 
plant virus capsid proteins 25 , and another 0-strand, 
antiparallel and N-terminal to A, which we label here strand 
A*. Both barrels have a further strand (which we label J) 
C-terminal to the barrel, making a total of eleven strands in 
the 0-sheet structure. Strands C of each barrel have a bulge at 
positions 65-68 and 252-255 respectively. 

The helical domains comprise residues ca 156-181 and 340-371, 
each being a three-helix cluster. These helices bear a 
striking similarity to a helix-turn-helix motif in Cro, a DNA 
binding protein 28 . This will be discussed further below. The 
occurrence on a single polypeptide of a jelly-roll /9-barrel and 
a helical cluster reminiscent of DNA binding proteins has 
previously been observed for the CAP protein. However, the 
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relative position- of these two domains in phaseolin and CAP is 
very different. 

The internal sequence repeat 16 observed in phaseolin, jack bean 
canavalin and pea vicilin is the basis for the structural 
repeat described here. The sequence similarity in the internal 
repeat is low («15* identity) , but the amino-acid alignment 
corresponds to the structural repeat perfectly in many places 
and to within a few residues in the worst cases... The domains 
are remarkably similar; a least-squares fit of the Ca positions 
in corresponding structural elements of the pair of units 
yields an r.m.s. deviation of only 2.2k. 

Exon/intron boundaries are the same in phaseolin and 
conglycinin DNA sequences 29 ' 7 ' 12 . Those five boundaries are, 
in the first unit, at the DE corner (residue 81) , within the J 
0-strand (144) , the H2-H3 comer (171) ; and in the second 
unit, the BC corner (248) and the p->a connection (335) . if 
strand J is considered as a linking element between the 0 and a 
structures, then all these boundaries lie at the 
interconnection of various elements of the secondary structure. 

Trimer structure 



The six structural a+0 units which comprise the phaseolin 
trimer are arranged alternately up and down around the 
three-fold axis and are nearly coplanar. The trimer is 
therefore a disc of diameter 90A and thickness 35A, as 
predicted by electron microscopy 30 and exhibits approximate 32 
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point group symmetry. Such an arrangement suggests two types of 
•interfaces 1 between the units, intrachain contacts within one 
polypeptide , and trimer contacts between the three chains. Each 
interface is centred around a pseudo-diad relating the two 
structurally-similar units of the polypeptide. No electron 
density is visible for the region linking the units ( residues 
213-219) leaving an apparent ambiguity as to which adjacent 
pair of units in the trimer is on the one polypeptide chain 
(see Figure 3). One pair has a distance of 37A between the Ca 
positions of residues 212 and 220, and the other has a 
corresponding distance of 18 A. To avoid postulating an 
extremely extended polypeptide linker, we conclude tentatively 
that the subunit is most likely formed from the latter pair. 
(The question of definition of the trimer within the tetramer 
will be discussed below; for completeness it should be pointed 
out that it is not possible for the missing polypeptide link to 
run from an N-terminal unit in one trimer to the C-terminal 
unit in another as these are separated by » 60A) . 

The jelly-roll 0-barrel has a remarkable capacity to exist in 
different states of oligomerization 32 ' 33 . The hexameric pattern 
observed here with quasi -3 2 point group symmetry is unlike the 
packing around the quasi-6-fold axis of the T=3 icosahedral 
surface lattice 26 . Furthermore the specific interactions 
between the barrels seen here are unlike those observed in T=l 
capsids 25 , tumour necrosis factor 34 or in the adenovirus 
hexon 33 . 

The B-I-D-G face of the two barrels associate with each around 
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aligned packing of ^-sheets. The two potential N-linked 
glycosylation sites are both within the c-terminal barrel on 
strands A and I respectively. The sugar attached to strand I 
could contact the neighbouring N-terminal barrel. 

The helical domains of the subunits are closely associated with 
each other around the pseudo-diad relating the neighbouring 
subunits (see Figure 2b) . As mentioned above, t* e 
helix-turn-helix motifs formed by helices 2 and 3 of each 
domain are similar to those found in certain DNA-binding 
proteins**, typified by the Cro protein, a least-squares fit 
of the ca positions of residues 163-183 and 351-371 of 
phaseolin to the helix-tum-helix motif formed by residues 
16-36 of cro yields an r.m.s. deviation of 1.9A and 1.3A 
respectively, within the spread determined for these motifs^. 
The capacity of helix-turn-helix structures to bind DNA is 
associated with their condensation into dimers with diad 
symmetry coincident with that of double-stranded DNA, in 
particular with elements of the diad-related helices positioned 
34A apart and binding to consecutive major grooves of DNA. No 
such 34A period is evident in the dimerisation of the phaseolin 
helical domains, nor indeed are the two domains identical. The 
dimensions of that domain pair are more of the. order of ISA (in 
the trimer, not the polypeptide as we have chosen to define it 
here, Figure 3) , sufficient only to span adjacent major and 
minor grooves of DNA. All of the DNA-protein structures solved 
to date which have helix-turn-helix motifs for the DNA binding 
site exemplify the diad-symmetrical form of binding. 
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Recent studies 35 ' 36 Q f structures in the protein data bank 37 
have demonstrated that DNA binding protein II 38 , the ribosomal 
L7/L12 protein 39 and cytochrome c peroxidase 40 all contain 
helix-tum-helix motifs. The first two of these bind nucleic 
acids, whether or not via the helix-turn-helix motif is 
unknown. However the cytochrome C peroxidase structure embodies 
the motif semi-internally, making interaction with DNA very 
unlikely 35 . 

In phaseolin the second helix in the motif (i.e. the third in 
the helical domain) is exposed to the environment in both 
structural units on the polypeptide and within the trimer 
(Figure 3) . We know of no evidence suggesting that phaseolin is 
able to bind DMA, specifically or non-specifically. The 
translocation of the phaseolin polypeptide from its site of 
synthesis on membrane-bound polysomes through the endoplasmic 
reticulum into storage bodies 41 ' 3 makes such an interaction 
unlikely. Nevertheless, DNA-binding studies could provide 
direct evidence for or against a role for storage proteins in 
the regulation of their own high level of synthesis. 

Ligand binding sites 

We have previously reported 19 the results of a phosphorus 
analysis of type II phaseolin crystals which implies that two 
molecules of phytic acid might be associated with each SOkDa 
polypeptide. There is no electron-dense feature in the current 
3A map consistent with such binding, though phytate binding to 
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the flexible ends, or linker region of the polypeptide cannot be 
excluded . 

At the interface between the two jelly-roll domains of the 
subunit involving the sheets B-I-D-G referred to above, there 
is an intense spherical electron-dense feature lying in a 
pocket formed by the side chains of residues His 32, Glu 56, 
Arg 58, Arg 79, Arg 130, Lys 296 and Asp 297, A possible 
explanation is that it corresponds to phosphate bound either in 
vivo or during the purification process. A second unexplained, 
though more diffuse, electron-dense feature occurs within the 
N-terminal ^-barrel and is associated with the side chains of 
residues Gin 133 and Glu 145. 



The tetramer of triaers 



The crystallization conditions used here for phaseolin have 
encouraged the formation of tetrameric 18S particles. The 
association of trimers into dodecamers under conditions of acid 
pH is well-characterized 18 , and 18S particles so produced have 
been imaged by electron microscopy 30 . The 18S particle 
observed in the crystal, being formed by four trimers assembled 
on the faces of a regular tetrahedron, is consistent with the 
negatively stained e.m. image as judged by the positioning of 
the solvent channels. 

There are two ways of grouping the twelve subunit polypeptide 
chains in the tetrahedron into trimers. Inspection of the 
particle shows that in the one association, the intra-triraer 
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polypeptide contacts are extensive (see Figure 3) whereas in 
the alternate association the subunits are tenuously packed 
(reminiscent of protein crystal packing) . We conclude that the 
former arrangement is the 7S (ISOkDa) particle occurring 
in vivo and observed as a featureless disc by electron 
microscopy 30 , and have used this definition of the trimer in 
the discussion of the preceding sections. The inter-trimer 
contact in the tetramer occurs at the tetrahedral diad axes. It 
consists of a symmetry-related pair of interactions, each 
involving the N-terminal 0-barrel of one subunit with the 
C-terminal ^-barrel of a subunit from the neighbouring trimer. 
The N-terminal strands of polypeptides from neighbouring 
trimers aire also in contact at the tetrahedral diad. 

In situ organization 

Although no crystallinity of Phased us vulgaris seed sections 
is observable by X-ray dif fraction 19 , we cannot rule out a 
possible link between the curious packing of phaseolin in 
pseudo-cubic high-salt crystals 19 ' 42 with structures in situ in 
the seed-storage protein body. (Such a link has been 
demonstrated for the IIS protein cucurbitin 43 } . The high- 
saltcrystals, with pseudo -cubic symmetry P432 and cell 
dimension a » 66k, can contain only one phaseolin trimer per 
unit cell. What combination of crystal twinning and/or disorder 
generates the pseudo-cubic point group is unclear. One 
possibility emerges from the dodecamer structure observed here. 
The centre-to-centre distance between the trimers in the 
tetrahedral 18S particle is » 65k. Packing errors, whereby 
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trimers are incorporated into the tetrahedral structures 
without proper cognizance of the top and bottom face of the 
trimer, could lead to non-closed forms of the tetramer (of 
trimers) and an average structure in which the 65A period was 
associated with a cubic point group. Other explanations 
include the possibility that the pseudo 32 point group of the 
trimer is not maintained in high salt, but is distorted into a 
quasi-cubic structure • 

Related Structures 

Homology with other vicilins 

Sequence alignments imply structural similarity of 7S proteins 
from common bean, jack bean, soybean and pea seeds 12 ' 15 ' 16 ■ 
Insertions and deletions in these alignments are, with minor 
modifications, compatible with the phaseolin structure (see 
Table 1) . For the most part, the phaseolin sequence is shorter 
than the other sequences, the exception being a five residue 
insertion around position 350, which may affect the structure 
of the connection between helices l and 2 of the C- terminal 
structural unit. Significant insertions in the other sequences 
are found near the N-tenainus and at the FG corner (residue 
107, Table l) of the first barrel domain (or f -subunit of 
0-conglycinin of soybean) , in the linker between helices 3 and 
4 (residue 188) of the first helical domain (pea vicilin) , in 
the linker (residue 218) between the two a+0 units (all 
sequences except phaseolin), and at the EF corner (residue 283) 
of the second barrel domain (larger inserts for pea proteins 
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than for soybean) • ah of these regions are on the surface of 
the trimer, (The insertion site at residue 218 15 has been 
placed alternatively at residue 247 12 ; we consider that the 
former leads to the more natural sequence alignment) • 

A preliminary structure for jack bean canavalin (A.McPherson, 
personal communication) shows pseudo 32 point group symmetry 
and a viral capsid domain in each of the two structural units 
per polypeptide. 

Homology with legumins 

Based on amino-acid sequence similarity, the basic polypeptide 
chain of legumins is believed to contain one copy of the 
repeating structural unit of a vicilin molecule 16 , i.e. the 
a+0 structure described here. Weak similarity between parts of 
the acidic chain and the 7S proteins has also been reported, 
but is less convincing 15 . X-ray diffraction experiments 43 have 
indicated that the legumin protein cucurbitin has point group 
symmetry 23, at least at low resolution (ca 20A) . This 
observation requires that the polypeptide displays an internal 
repeating structure, which in the US particle is a diad, 
albeit not perpendicular to any three-fold axis. The assembly 
of the six polypeptides of legumin into the accumulated 
oligomeric form is believed to proceed via a IS (trimer) 
intermediate, the final coalescing of trimers not occurring 
until cleavage of the polypeptides into the basic and acidic 
chains 44 . If the trimer form is indeed similar to the vicilin 
7S molecules, with pseudo point group 32, then a rearrangement 
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of the subunits within the trimer must accompany the formation 
of the lis particle, in order to satisfy the resulting cubic 
arrangement of diads and triads. 

Novel phaseolins 
5 One attempt has previously been made to increase the 

sulphur content of phaseolin by the insertion of a 45 
base-pair synthetic duplex, rich in raethionine-codons, at the 
Xba I site in the third exon of the /L -phaseolin gene. 
Transgenic tobacco plants containing normal or modified 

10 phaseolin genes were then monitored for the production and 
deposition of phaseolin. Whereas in both cases phaseolin 
expression was achieved, deposition of the modified protein in 
the storage bodies did not occur, suggesting prior 
degradation 45 . Using the three-dimensional structure, we are 

15 now able to identify the Xba I site (after residue 165, Table 
1) as being internal to helix 2 (i.e. the first helix in the 
helix-turn-helix motif) and hence part of a major structural 
element of the phaseolin trimer. We therefore predict that an 
inclusion of fifteen residues at this site could distort the 

20 structure at the tertiary and/or quaternary level, making the 
modified protein more susceptible to degradation. 

Conclusions 

We have now confirmed the internal repeat observed 
in the sequence of phaseolin at a structural level; this 

2 5 repeat manifests itself as a pair of « + units forming the 
50kDa subunit. The high sequence similarity to other 7S 
storage proteins suggests a common structure; indeed a 
preliminary X-ray structure for canavalin, the 7S protein of 
the jack bean, indicates a hexameric arrangement of ^-barrels 

30 as described here (A. McPherson, personal communication) . 

The structure presented here identifies surface 
regions of the molecule, which represent candidate sites for 
genetically-engineered insertions into the sequence. 
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The invention will now be illustrated by way of 
reference only to the following non-limiting examples. The 
references to amino acid sequence numbering are as in Table 1. 

In these examples, site directed mutagenesis is used 
5 to introduce desired mutations at defined sites in known 
fragments of wild type phaseolin cDNA. 

Mutations are introduced by known methods, such as 
those described by Kunkel (46), into restriction fragments 
which include the segment of DNA encoding the locus for the 

10 desired mutation. The resulting DNA is used to transfect a 
host, such as E.coli , and single stranded DNA prepared from 
plaques. The presence of the desired mutation is confirmed by 
complete sequencing of the same restriction fragment, for 
example using the method of Sanger et al. (47). The double 

15 stranded replicative form of DNAs of proven mutants is 

isolated, and the mutant restriction fragments isolated by 
agarose electrophoresis (48). These fragments are then used 
to replace the corresponding fragment of wild type DNA in an 
expression vector. All the recombinant methods used are as 

20 described by Sambrook et al. (48). Other suitable methods for 
these procedures would readily occur to the person skilled in 
the art. 

Example 1 - Engineering of High Methionine Phaseolin Mutations 
The two sequences selected as targets for 

25 engineering of high -methionine phaseolins are "ILVLV" in 
region E between amino acids 61 and 120, Table 1 and "IVTLV" 
in region D between amino acids 241 and 300, Table 1, which 
both correspond to hydrophobic domains, ie. regions of 
internal y$ structure. Each of these sequences is mutated to 

30 code for methionine pockets (MMMMM) by iterated cycles of 
oligonucleotide site-directed mutagenesis. An intron-less 
yVphaseolin minigene is mutated; this minigene, designated 
£wt*~ , has recently been shown to direct efficient expression 
of phaseolin protein in transgenic tobacco seeds (Bustos et 

35 al., 1990; (49)). The advantages of using the minigene 
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instead of phaseolin genomic DNA fragments derive from the 
absence of intervening sequences. This facilitates the design 
of gene constructions and permutations of DNA fragments to 
combine different mutations. Due to the large number of 
5 individual nucleotides that need to be mutated over a span of 
15 or 18 nucelotides, a strategy involving multiple rounds of 
mutagenesis is preferable to one that would attempt to produce 
all changes at once. This has the added advantage of 
generating intermediates with fewer than five new methionines , 
10 which may be useful when assessing the effects of individual 
amino acid changes on the structure and in vivo stability of 
the protein. The detailed mutagenesis strategies are as 
follows : 



"ILVLV" site: 

15 A 381 bp EcoRI-PstI restriction fragment from clone 

£wt 1- (Bustos et al., 1990) is subcloned into the phagemid 
vector pBSKS+ (Stratagene) and the resulting clone is used to 
produce a U-containing single stranded DNA template. Three 
different synthetic single-stranded oligonucleotide DNA 

20 molecules (shown below as oligos I to III) are utilized to 
mutagenize the wt sequence by the method of Kunkel (46). The 
individual base substitutions are indicated in boldface. 
Screening of mutated clones is performed directly by 
sequencing with an oligonucleotide primer designed to 

25 hybridize 35 nucleotides upstream of the target sequences. 

81 91 
gly ser ala lie Leu Val Leu Val lys pro asp 

wt . • . ggg age gec ATA CTC GTC TTG GTG aaa cct gat • • • 

5 • 3 1 

oligo I age gec ATG ATG GTC TTG 

oligo II 5, G GTC TTG ATG aaa cct 3 ' 

30 oligo III 5f gcc ATG ATG ATG ATG ATG aaa cct 3 ' 

final 81 gly ser ala Met Met Met Met Met lys pro asp 91 

• . . ggg age gec ATG ATG ATG ATG ATG aaa cct gat . . . 
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After each round of mutagenesis has been completed, 
the new EcoRI-PstI DNA fragments are replaced into the /5wt x ~ 
gene to yield modified phseolin genes coding for two, three 
and five new methionine residues. 

5 "IVILV" site" 

A 2.0 kbp Xbal-BamHI restriction fragment from clone 
ySwt 1 " (Bustos et al., 1990) is subcloned into the phagemid 
vector pBSKS+ (Stratagene) and the resulting clone used to 
produce a U-containing single-stranded DNA template. In a 

10 manner analogous to that employed at the ILVLV site, three 
synthetic oligonucleotides (oligos IV to VI, shown below) are 
used to rautagenize the wt sequence using the same method. As 
for the other site, each oligo is designed to mutate two or 
three closely spaced nucleotides, and all oligos result in at 

15 least one new methionine codon. 

258 ser lys ala He Val He Leu Val val asn glu gly 269 
wt ... tct aag gcc ATT GTT ATA CTA GTG gtt aat gaa gga . . 

oligo IV 5 'aag gcc ATG ATG ATA CTA GT 3 ' 

oligo V 5 'tG ATA CTG ATG gtt aat 3 ' 

20 oligo VI 5 'tG ATG ATG ATG ATG gtt 31 

final 258 ser lys ala Met Met Met Met Met val asn glu gly 269 

. . • tct aag gcc ATG ATG ATG ATG ATG gtt aat gaa gga . . 

As for the ILVLV site, the sequences at and around 
the mutated sites are verified using a sequencing primer that 
25 hybridizes near the site. After mutagenesis the Xbal-BamHI 
fragments are replaced into /$wt 1- " or into the high-methionine 
mutants modified at the ILVLV site. A maximum of ten new 
methionine residues result from the combination of mutations 
at both sites. 

30 All mutated phaseolin genes are subsequently 

subcloned into the binary vector pBIN19 and transferred into 
the genome of tobacco by A^ tumef aciens -mediated 
transformation . 
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Example 2 - Model Building of High-Methionine Mutant Phaseolin 

The structure of the high-methionine mutant 
phaseolin can be investigated by model-building as follows. 
An interactive molecular-graphics computer program (QUANTA, 
5 Polygen Corporation) was used to model the replacement of 
residues 84 to 88 and 261 to 265 by methionine. The energy- 
refinement computer-program XPLOR (Harvard University) was 
then used to refine the atomic coordinates of the mutant 
structure to ensure that favourable stereochemistry is 

10 achieved, keeping the remainder of the protein as close as 
possible to the native structure. The results of the 
modelling are shown in Figures 4 and 5; there are no 
significant difference in energy between the native and mutant 
structures (native: -11529 Kcal/mol, mutant: -11677 Kcal/mol) 

15 and no disruption to the tertiary or quaternary structure is 
predicted. 

Example 3 

The environment of the following residues in the 
phaseolin structure indicates that they can be altered to 
2 0 Methionine without prejudice to the three-dimensional 
structure of the protein: 

Leu 65, Leu 76, Phe 252, Leu 264, He 177, Leu 181. 

Example 4 

Mutation of Isoleucine 105 to Asparagine introduces 
25 an N-linked glycosylation site at that position. Sugar 
attached at that site may be accommodated on the surface of 
the phaseolin trimer. 

Example 5 

Elimination of the loop of structure between beta 
30 strands E and P of the C- terminal structural unit may be 
accomplished by deleting residues numbered 278 to 287 and 
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replacing them with a linker of two glycine residues* This 
increases the stability of phaseolin to enzymes with 
trypsin-like specificity. 

In general, the selection of sites for mutagenesis 
5 may be influenced by the accessibility of the target site to 
restriction endonucleases. The criteria for assessing whether 
the desired mutation has been achieved will depend on the 
nature of the mutation attempted, but include? 

a) sequencing of the DNA in the construct; 
10 b) analysis of the product protein for presence or 

absence of the desired property, eg. by amino 
acid analysis or susceptibility to enzymes; 

c) retention of tertiary or quaternary structure; 

d) cross-reaction with antibody directed against 
15 the native protein; and 

e) successful expression in vivo . 

For example, if the mutation is to substitute 
methionine for another amino acid, the presence of one or two 
met residues could be detected by cyanogen bromide cleavage; 

20 the presence of about 10 met residues could be readily assayed 
by amino acid analysis. Retention of three dimensional 
structure (tertiary or quaternary) can be assessed in advance 
by model building, and tested in the protein actually produced 
by success in expression in vivo , and by various physical 

25 analyses such as molecular weight, solubility/ and circular 
dichroism spectral studies. Other suitable methods will be 
known to those skilled in the art. 

Although the invention has been illustrated with 
reference to phaseolin, sequence homologies indicate that 
30 similar results may be expected with related proteins, such as 
the homologous protein from pea, and to some extent with 
legumins as implied by the homology described in reference 16. 

References cited herein are identified in full on 
the following pages. 
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It will be clearly understood that the invention in 
its general aspects is not limited to the specific details 
referred to hereinabove. 
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CLAIMS : 

A mutein which is a variant of a naturally-occurring 
legume seed storage protein, wherein said mutein has a 
modified primary structure relative to said legume storage 
protein, but retains the tertiary and quaternary structure of 
said legume storage protein. 

2 - A plant 7S or US storage protein modified at a 

specific amino acid residue or a specific region of its amino 
acid sequence, wherein the tertiary and quaternary structure 
of the naturally occurring storage protein is retained. 
3. A storage protein according to Claim 1 or Claim 2, 

wherein the modification is selected from the group consisting 

(a) introduction of one or more point mutations; 

(b) deletion of one or more defined sequences of 
amino acids; 

(c) insertion of one or more defined sequences of 
amino acids; 

(d) 

sites; and 

(e) introduction of one or more disulphide bonds. 

4 - A plant storage protein according to Claim 3, 
wherein the modification is selected from the group consisting 

(a) introduction of N-linked glycosylation sites; 

(b) substitution of internal amino acids by 
methionine; 

(c) truncation of protease-labile loops; and 

(d) introduction of protease-labile loops. 

5- A protein according to any one of Claims 1 to 5 
which is a vicilin or a legumin. 

6. A protein according to Claim 5 which is a phaseolin. 

7. A DNA molecule whose sequence encodes a storage 
protein as defined in any one of Claims 1 to 6. 
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8. A DNA molecule according to Claim 7, whose sequence 
encodes a protein having the properties of a plant 7S or IIS 
storage protein , and also encodes one or more of the . 
modifications set out above. 

9. A DNA molecule according to Claim 8 wherein the 
protein is a vicilin or a legumin. 

10. A DNA molecule according to Claim 9 wherein the 
protein is phaseolin. 

11. An autonomous unit of DNA replication in vivo or in 
vitro selected from the group consisting of plasmids, cosmids, 
expression vectors, microorganisms, viruses and chromosomes, 
said autonomous unit comprising a DNA molecule according to 
any one of Claims 7 to 10. 

12. A autonomous unit of DNA replication according to 
Claim 11 which is Agrobacterium tumef aciens . 

13. A autonomous unit of DNA replication according to 
Claim 11 which is a plant virus. 

14. A transgenic plant or part thereof comprising a DNA 
sequence as defined in any one of Claims 7 to 10. 

15. A seed of a transgenic plant according to Claim 14. 

16. A product obtained from a plant according to Claim 
14. 
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ASP CB 
ASP OD1 
ASP C 
ASP N 
ASN N 
ASN CB 
ASN OD1 
ASN C 
PRO N 
PRO CA 
PRO CG 
PP.0 0 
PHE CA 
PHE CG 
PHE CD 2 
PHE CE2 
PHE C 
TYR N 
TYR CB 
TYR CD1 
TYR CD 2 
TYR CZ 
TYR C 
PHE N 
PHE CB 
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PHE CZ 
PHE 0 
ASN CA 
ASN CG 
ASN ND2 
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SER CA 
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SER 0 
ASP CA 
ASP CG 
ASP OD2 
ASP 0 
ASN CA 
ASN CG 
ASN ND2 
ASN 0 
SER CA 
SER OG 
SER 0 
TRP CA 
TRP CG 
TRP CE2 
TRP CD1 
TRP C22 
TRP CH2 
TRP 0 
ASN CA 
ASN CC 
ASN ND2 
ASN 0 
THR CA 
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2.38 
3.32 
3.49 
2.22 
2.64 



52 
74 
17 
61 



1.81 
4.04 
0.33 
0.98 
3.38 
4.30 
5.50 
-0.16 
-0.48 

-1.04 - 

-2.89 - 
-2.24 

-3.95 - 

-2.78 - 

-3.79 - 

-5.79 - 

-5.78 - 

-5.15 - 

-3.90 - 

-6.47 - 

-6.76 - 

-6.74 - 

-7.57 - 

-8.80 - 

-10.32 - 

■10.32 - 

•12.27 - 

-11.38 - 

-9.66 - 

-9.28 - 

-12.23 - 
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-19.99 - 
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PRO CB 
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2. 


73 


17 
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18.43 ■ 


PRO C 


13 


1.15 


1. 


15 


19. 


00 


18.17 


PHE N 


14 


1.52 


1. 


52 


19. 


91 


19.99 


PHE CB 


14 


2.02 


2. 


02 


20. 
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19.80 


PHE CD1 


14 


3.66 


3. 


66 


18 


58 


20.43 


PHE CE1 


14 


4.86 


4, 


86 


18 


03 


19.85 


PHE CZ 


14 


5.78 


5. 


78 


18 


66 


20.96 


PKE 0 


14 


-0.80 


-0. 


80 


21 


03 


21.74 


TYR CA 


15 


-1.51 


-1. 


51 


22. 


74 


24.00 


TYR CG 


15 
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25 
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26.45 
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17 


26 
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15 


-4.88 


-4. 
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TYR 0 


15 
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79 
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14 


21,99 


PHE CA 


16 
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13 


21 


61 


20.73 


PHE CG 


16 


-5.13 


-5 


13 


19 
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16 
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89 


19 
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PHE CE2 


16 


-3.26 


-3. 


26 


18 
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PHE C 


16 


-5.97 


-5 
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23.52 


ASN N 


17 


-6.17 


-6. 


17 


23 


16 


24.39 


ASN CB 


17 


-6.43 


-6. 


43 


24 


56 


25.93 


. ASN ODl 


17 


-6.11 


-6 


11 


26 


28 


26.78 


ASN C 


17 


-8.22 


-8 


22 


24 


31 


23.39 


SER N 


18 


-8.90 


-8 


90 


25 
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25.06 


, SER CB 


18 


-10.82 


-10 


82 


26 


02 


27.25 


SER C 


18 


-11. 09 
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09 


25 


21 


24.86 


ASP N 


19 


-10.50 


-10 


50 


25 


59 


25.67 


. ASP CB 


19 


-11.09 


-11 


09 


26 


89 


27.12 


: ASP ODl 


19 


-8.90 


-8 


90 


26 


16 


28.26 


: ASP C 


19 


-11.36 


-11 


36 


24 


38 


24.14 


: ASN N 


20 


-10.48 


-10 


48 


23 


44 
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: ASN CB 


20 


-9.90 


-9 


90 


21 
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: ASN ODl 


20 


-8.00 
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21 
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: ASN C 


20 


-10.63 


-10 


.63 


21 


00 


19.85 


: SER N 
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-10.60 


-10 
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20.06 


: SER CB 
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-9.12 


-9 
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: SER C 


21 


-11.35 


-11 
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: TRP N 
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-12.56 
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19.55 


: TRP CB 
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-14.17 
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-13.16 
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-15.38 
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: ASN CB 


23 
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18 
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: THR CB 


24 


-20.90 
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19 


.21 
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PHE CD1 26 -28.28 -27.02 16.36 : PHE CD 2 26 -27.36 -27.36 18.05 
PHE CE1 26 -29.52 -27.25 16.86 : PHE CE2 26 -28.60 -28.60 18.54 
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ILE C 
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-20. 
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-20. 
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ILE 0 33 -20.99 -27.83 18.00 : ARG N 34 -19.24 -19.24 18.89 

ARG CA 34 -18.32 -27.76 18.55 : ARG CB 34 -17.69 -17.69 19.81 

ARG CG 34 -18.74 -26.44 20.52 : ARG CD 34 -18.73 -18.73 21.99 

ARG NE 34 -17.76 -25.75 22.64 : ARG CZ 34 -16.93 -16.93 23.57 

ARG NH1 34 -16.03 -25.49 24.21 : ARG NH2 34 -16.93 -16.93 23.85 

ARC C 34 -17.21 -28.23 17.60 : ARG 0 34 -16.93 -16.93 17.53 

VAL N 35 -16.55 -27.39 16.81 : VAL CA 35 -15.51 -15.51 15.90 

VAL CB 35 -15.93 -27.51 14.48 : VAL CGI 35 -14.97 -14.97 13.57 

VAL CG2 35 -17.30 -28.05 14.14 : VAL C 35 -14.26 -14.26 16.26 

VAL 0 35 -14.39 -25.79 16.45 : LEU N 36 -13.05 -13.05 16.39 

LEU CA 36 -11.88 -26.80 16.77 : LEU CB 36 -10.59 -10.59 16.97 

LEU CG 36 -9.77 -27.74 18.26 : LEU CD1 36 -8.33 -8.33 17.85 

LEU CD2 36 -9.94 -26.65 19.25 : LEU C 36 -11.51 -11.51 15.62 

LEU 0 36 -11.85 -26.20 14.47, : GLN N 37 -10.84 -10.84 15.81 

GLN CA 37 -10.26 -23.98 14.73 : GLN CB 37 -9.65 -9.65 15.28 

GLN CG 37 -8.57 -22.82 16.32 : GLN CD 37 -7.59 -7.59 16.16 

GLN OE1 37 -7.42 -20.76 16.96 : GLN NE2 37 -6.95 -6.95 14.99 

GLN C 37 -9.19 -24.83 14.03 : GLN 0 37 -8.80 -8.80 14.54 
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ARG N 38 -8.81 -24.50 12.80 : ARG CA 38 -7,82 -7.82 12.03 

ARG CB 38 -7.59 -24.77 10.65 : ARC CG 38 -8.76 -8.76 9.82 

ARG CD 38 -8.49 -25.08 8.54 : ARG NE 38 -8.67 -8.67 7.42 

ARG CZ 38 -7.62 -23.92 6.68 : ARG NH1 38 -7.84 -7.84 5.64 

ARG NH2 38 -6.40" -24.37 6.96 : ARG C 38 -6.46 -6.46 12.64 

ARG 0 38 -6.11 -24.14 13.08 : PHE N 39 -5.63 -5.63 12.64 

PHE CA 39 -4.31 -26.18 13.23 : PHE CB 39 -3.68 -3.68 13.35 

PHE CG 39 -4.49 -28.49 14.22 : PHE GDI 39 -4,59 -4.59 15.58 

PHE CD2 39 -5.10 -29.58 13.67 : PHE CE1 39 -5.28 -5.28 16.34 

PHE CE2 39 -5.80 -30.47 14.44 : PHE CZ 39 -5.89 -5.89 15.78 

PHE C 39 -3.39 -25.31 12.41 : PHE 0 39 -2.72 -2.72 13.01 

ASP N 40 -3.28 -25.35 11.09 : ASP CA 40 -2.37 -2.37 10.35 

ASP CB 40 -2.12 -25.10 8.97 : ASP CG 40 -3.36 -3.36 8.12 

ASP 0D1 40 -4.42 -25.50 8.54 : ASP 0D2 40 -3.32 -3.32 7.05 

ASP C 40 -2.92 -23.07 10.27 : ASP 0 40 -2.17 -2.17 10.03 

GLN N 41 -4.18 -22.75 10.44 : GLN CA 41 -4.61 -4.61 10.65 

GLN CB 41 -6,09 -21.15 10.99 : GLN CG 41 -6.70 -6.70 10.03 

GLN CD 41 -6.79 -20.69 8.57 : GLN 0E1 41 -5.81 -5.81 7.87 

GLN NE2 41 -8.01 -20.84 8.01 : GLN C 41 -3.95 -3.95 11.88 

GLN 0 41 -3.75 -19.62 11.98 : GLN N 42 -3.69 -3.69 12.89 

GLN CA 42 -3.12 -21.09 14.10 : GLN CB 42 -3.40 -3.40 15.16 

GLN CG 42 -3.74 -21.17 16.29 : GLN CD 42 -3.06 -3.06 17.58 

GLN 0E1 42 -3.57 -21.23 18.66 : GLN NE2 42 -1.90 -1.90 17.48 

GLN C 42 -1.65 -20.83 14.03 : GLN 0 42 -1.15 -1.15 14.70 

SER N 43 -0.91 -21.60 13.28 : SER CA 43 0.53 0.53 13.15 

SER CB 43 1.20 -22.10 14.34 : SER OG 43 2.53 2.53 14.44 

SER C 43 0.93 -22,23 11.90 : SER 0 43 0.60 0.60 11.85 

LYS N 44 1.63 -21.73 10.89 : LYS CA 44 2.00 2.00 9.81 

LYS CB 44 2.54 -21.89 8.55 : LYS CG 44 1.57 1.57 7.30 

LYS CD 44 0.61 -23.38 7.21 : LYS CE 44 0.14 0,14 5.83 

LYS NZ 44 -0.82 -25.21 5.93 : LYS C 44 3.06 3.06 10.32 

LYS 0 44 3.42 -24.51 9.60 : ARG N 45 3.55 3.55 11.58 

ARG CA 45 4.42 -24.48 12.14 : ARG CB 45 4.86 4.86 13.55 

ARG CG 45 5.34 -22.96 13.90 : ARG CD 45 5.97 5.97 15.24 

ARG NE 45 6.53 -21.95 15.79 : ARG CZ 45 7.50 7.50 16.70 

ARG NH1 45 7.91 -20.90 17.29 : ARC NH2 45 8.14 8.14 16,95 

ARG C 45 3.58 -25.73 12.24 : ARG 0 45 4.11 4.11 12.07 

Lffl'N 46 2.27 -25.71 12.50 : LEU CA 46 1.49 1.49 12.63 

LEU CB 46 0.35 -26.72 13.61 : LEU CG 46 0.82 0.82 14.97 

LEU CD1 46 -0.32 -25.71 15.73 : LEU CD2 46 1.45 1.45 15.57 

LEU C 46 0.92 -27.25 11.29 : LEU 0 46 -0.18 -0.18 11.21 

GLN N 47 1.68 -27.00 10.23 : GLN CA 47 1.24 1.24 8.85 

GLN CB 47 2.29 -26.70 7.94 : GLN CG 47 2.40 2.40 6.62 

GLN CD 47 3.30 -26.44 5.80 : GLN 0E1 47 3.17 3.17 5.89 

GLN NE2 47 4.19 -27.04 4.99 : GLN C 47 0.90 0.90 8.48 

GLN 0 47 -0.08 -28.81 7.80 : ASN N 48 1.70 1.70 8.82 

ASN CA 48 1.32 -30.94 8.49 : ASN CB 48 2.51 2.51 8.77 

ASN CG 48 3.46 -31.75 7.59 : ASN ODl 48 4.39 4.39 7.54 

ASN ND2 48 3.35 -30.92 6.56 : ASN C 48 0.09 0.09 9.19 

ASN 0 48 -0.28 -32.65 9.00 : LEU N 49 -0.62 -0.62 9.96 

LEU CA 49 -1.82 -31.14 10.59 : LEU CB 49 -1.76 -1.76 12.01 

LEU CG 49 -1.80 -31.74 13.09 : LEU CD1 49 -1.09 -1.09 12.69 

LEU CD2 49 -1.14 -31.18 14.30 : LEU C 49 -2.87 -2.87 9.81 

LEU 0 49 -3.92 -30.23 10.38 : GLU N 50 -2.80 -2.80 8.53 

GLU CA 50 -3.82 -29.18 7.92 : GLU CB 50 -3.15 -3.15 6.80 

GLU CG 50 -3.92 -27.75 5.68 : GLU CD 50 -3.02 -3.02 4.48 

GLU 0E1 50 -3.35 -26.28 3.88 : GLU 0E2 50 -1.97 -1.97 4.19 

GLU C 50 -5.01 -29.96 7.47 : GLU 0 50 -6.14 -6.14 7.44 

ASP N 51 -4.78 -31.25 7.31 : ASP CA 51 -5.81 -5.81 6.88 
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ASP CB 51 -5.21 -33.38 6.26 
ASP 0D1 51 -4.68 -32.36 4.21 

ASP C 51 -6.64 -32.67 8.05 

TYR N 52 -6.48 -32.28 9.31 

TYR CB 52 -6.35 -33.60 11.31 

TYR CD1 52 -6.01 -35.97 10.64 

TYR CD2 52 -4.41 -34.37 10.01 

TYR CZ 52 -4.08 -36.63 9.44 

TYR C 52 -8.01 -31.94 11.14 

ARC N <53 -9.18 -32.31 11.63 

ARC CB 53 -11.20 -30.94 11.82 

ARG CD 53 -10.20 -28.99 10.71 

ARC CZ 53 -11.71 -27.11 10.92 

ARG NH2 53 -11.34 -26.97 12.18 

ARG 0 53 -10.49 -33.40 13.59 

LEU CA 54 -11.18 -32.37 15.95 

LEU CG 54 -9.16 -32.50 17.67 

LEU CD 2 54 -9.23 -33.94 18.03 

LEU 0 54 -12.77 -30.67 15.94 

VAL CA 55 -14.98 -32.07 16.45 

VAL CC1 55 -15.35 -32.84 14.04 

VAL C 55 -15.54 -32.84 17.66 

GLU N 56 -16.10 -32.14 18.67 

GLU CB 56 -15.64 -31.75 20.92 

GLU CD 56 -17.12 -30.40 22.55 

GLU 0E2 56 -17.28 -29.52 21.73 

GLU 0 56 -18.56 -31.67 19.53 

PHE CA 57 -20.12 -33.36 21.06 

PHE CG 57 -22.15 -34.85 20.30 

PHE CD2 57 -23.20 -34.13 19.74 

PHE CE2 57 -24.52 -34.54 19.80 

PHE C 57 -20.49 -33.75 22.45 

ARG N 58 -21.54 -33.24 23.07 

ARG CB 58 -21.52 -33.05 25.54 

ARG CD 58 -22.44 -32.83 27.83 

ARG CZ 58 -21.23 -32.75 29.99 

ARG NH2 58 -22.26 -33.31 30.65 

ARC 0 58 -23.97 -32.66 23.69 

SER CA 59 -25.70 -34.48 24.82 

SER OG 59 -25.92 -36.63 23.38 

SER 0 . 59 -25.56 -35.97 26.72 

LYS CA 60 -27.95 -35.01 27.84 

LYS CG 60 -28.58 -32.53 27.99 

LYS CE 60 -28.33 -32.42 30.58 

LYS C 60 -28.47 -36.48 27.77 

PRO N 61 -28.82 -37.21 28.84 

PRO CA 61 -29.53 -38.49 28.85 

PRO CG 61 -28.81 -38.18 30.92 

PRO 0 61 -31.36 -37.70 27.57 

GLU CA 62 -31.89 -40.28 26.47 

GLU CG 62 -32.93 -41.46 28.44 

GLU OE1 62 -34.25 -43.38 29.02 

GLU C 62 -32.07 -39.25 25.37 

THR N 63 -31.07 -39.08 24.51 

THR CB 63 -30.28 -36.96 24.10 

THR CC2 63 -28.84 -37.20 23.71 

THR 0 63 -30.02 -39.56 22.14 

LEU CA 64 -30.75 -38.64 19.68 

LEU CG 64 -31.85 -39.57 17.47 
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: ASP CG 51 -4.26 -4.26 5.11 

: ASP 0D2 51 -3.13 -3.13 5.12 

: ASP 0 51 -7.47 -7.47 7.84 

: TYR CA 52 -7.23 -7.23 10.37 

: TYR CC 52 -5.58 -5.58 10.63 

: TYR CE1 52 -5.26 -5.26 10.04 

: TYR CE2 52 -3.65 -3.65 9.41. 

: TYR OH 52 -3.31 -3.31 8.87 

: TYR 0 52 -7.57 -7.57 11.23 

: ARC CA 53 -9.95 -9.95 12.48 

: ARG CC 53 -10.97 -10.97 10.52 

: ARG NE 53 -11.08 -11.08 10.16 

: ARG NH1 53 -12.62 -12.62 10.40 

: ARG C 53 -10.39 -10.39 13.68 

: LEU N 54 -10.69 -10.69 14.82 

: LEU CB 54 -10.49 -10.49 17.21 

: LEU CD1 54 -8.80 -8.80 18.94 

: LEU C 54 -12.60 -12.60 16.05 

: VAL N 55 -13.67 -13.67 16.15 

: VAL CB 55 -15.94 -15.94 15.18 

: VAL CG2 55 -17.26 -17.26 15.53 

: VAL 0 55 -15.45 -15.45 17.68 

: GLU CA 56 -16.47 -16.47 19.99 

: GLU CG 56 -16.00 -16.00 22.37 

: GLU OEl 56 -17.85 -17.85 23.55 

: GLU C 56 -17.97 -17.97 20.17 

: PHE N 57 -18.66 -18.66 20.98 

: PHE CB 57 -20.69 -20.69 20.13 

: PHE GDI 57 -22.47 -22.47 20.95 

: PHE CE1 57 -23.79 -23.79 21.01 
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LEU CD2 64 -33.14 -39.04 16.91 

LEU 0 64 -30.94 -36.37 18.89 

LEU CA 65 -28.80 -36.62 17.12 

LEU CG 65 -26.49 -35.91 16.09 
LEU CD2 65 -25.06 -36.34 16.20 

LEU 0 65 -29.30 -38.24 15.47 

LEU CA 66 -30.87 -36.62 13.82 

LEU CG 66 -32.99 -35.29 14.43 
LEU CD 2 66 -33.97 -36.44 14.28 

LEU 0 66 -28.78 -36.57 12.70 

PRO CD 67 -31.65 -38.37 11.53 

PRO CB 67 -30.42 -38.92 9.64 

PRO C 67 -28.69 -37.16 9.73 

GLN N 68 -27.38 -37.32 9.57 

GLN CB 68 -25.81 -35.55 9.57 

GLN CD 68 -24.11 -35.37 11.37 

GW NE2 68 -23.63 -34.17 11.13 

GLN 0 68 -25.41 -38.41 7.97 

GLN CA 69 -23.57 -37.19 6.28 

GLN CG 69 -24.66 -37.75 4.14 

GLN 0E1 69 -24.53 -35.78 2.80 

GLN C 69 -22.47 -36.15 6.42 

ALA N 70 -21.20 -36.45 6.63 

ALA CB 70 -19.41 -35.44 7.93 

AUO 70 -18.97 -36.70 5.18 

ASP CA 71 -17.63 -34.86 3.92 

ASP CG 71 -17.03 -32.37 3.54 

ASP 0D2 71 -17.56 -32.02 4.58 

ASP 0 71 -15.28 -34.87 4.06 

ALA CA 72 -14.96 -36.17 6.35 

ALA C 72 -14.88 -37.54 6.91 

GLU N 73 -13.79 -38.27 6.96 

GLU CB 73 -12.44 -40.11 7.17 

GLU CD 73 -12.40 -42.58 6.80 

CUJ 0E2 73 -13.12 -42.37 5.77 

GLU 0 73 -13.00 -38.37 9.53 

LEU CA 74 -14.55 -39.60 11.46 

LEU CG 74 -16.55 -37.93 11.81 

LEU CD2 74 -18.02 -38.04 11.60 

LEU 0 74 -14.51 -41.85 12.05 

LEU CA 75 -12.84 -41.55 14.24 

LEU CG 75 -10.68 -42.43 15.40 

LEU CD2 75 -9.43 -42.91 14.82 

LEU 0 75 -13.13 -39.88 15.96 

LEU CA 76 -15.55 -41.26 16.86 

LEU CG 76 -18.23 -41.53 16.75 

LEU CD2 76 -18.48 -40.13 16.31 

LEU 0 76 -14.58 -43.03 17.97 

VAL CA 77 -14.18 -41.82 20.47 

VAL CGI 77 -12.27 -41.63 22.08 

VAL C 77 -15.01 -41.44 21.67 

VAL N 78 -15.37 -42.31 22.60 

VAL CB 78 -17.25 -42.95 23.93 

VAL CG2 78 -18.12 -43.15 22.73 

VAL 0 78 -14.84 -42.35 25.55 

ARG CA 79 -14.86 -39.95 26.71 

ARC CG 79 -14.02 -37.74 25.58 

ARG NE 79 -13.74 -35.32 25.50 

ARG NH1 79 -13.07 -33.09 25.61 
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: LEU CB 75 -11.35 -11.35 14.31 

: LEU CD1 75 -11.37 -11.37 15.82 

: LEU C 75 -13.49 -13.49 15.46 

: LEU N 76 -14.58 -14.58 15.82 

: LEU CB 76 -16.84 -16.84 16.40 

: LEU CD1 76 -18.44 -18.44 18,21 

: LEU C 76 -14.95 -14.95 18.10 

: VAL N 77 -14.79 -14.79 19.26 

: VAL CB 77 -12.78 -12.78 20.72 

: VAL CG2 77 -11.83 -11.83 19.84 

: VAL 0 77 -15.33 -15.33 21.80 

: VAL CA 78 -16.21 -16.21 23.68 

: VAL CGI 78 -18.19 -18.19 25.01 

: VAL C 78 -15.44 -15.44 24.94 

: ARC N 79 -15.30 -15.30 25.36 

: ARG CB 79 -14.43 -14.43 26.82 

: ARG CD 79 -14.70 -14.70 25.37 

: ARG CZ ■ 79 -13.99 -13.99 25.33 

: ARG NH2 79 -15.17 -15.17 24.82 
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ARG C 79 -16.20 -40.10 27.50 : ARC 0 79 -17.24 -17.24 26.90 

SER N 80 -16.49 -40.50 28.75 : SER CA 80 -17.91 -17.91 29.29 

SER CB 80 -18.40 -39.17 29.61 : SER OG 80 -17.56 -17.56 30.62 

SER C 80 -19.07 -41.27 28.53 : SER 0 80 -19.95 -19.95 27.93 

GLY N 81 -18.96 -42.60 28.70 : GLY CA 81 -19.64 -19.64 28.06 

GLY C 81 -21.11 -43.61 27.77 : GLY 0 81 -21.73 -21.73 28.31 

SER N 82 -21.62 -44.58 26.99 : SER CA 82 -23.00 -23.00 26.46 

SER CB 82 -23.92 -43.63 26.88 : SER OC 82 -23.45 -23.45 26.23 

SER C 82 -23.01 -44.67 24.89 : SER 0 82 -22.00 -22.00 24.31 

ALA N 83 -23.96 -44.07 24.15 : ALA CA 83 -23.96 -23.96 22.68 

ALA CB 83 -22.71 -43.47 22.06 : ALA C 83 -24.11 -24.11 21.88 

ALA 0 83 -23.33 -46.28 21.95 : ILE N 84 .25.16 -25.16 21.09 

ILE CA 84 -25.51 -46.45 20.19 : ILE CB 84 -26.99 -26.99 20.33 

ILE CG2 84 -27.43 -47.70 19.14 : ILE CGI 84 -27.19 -27.19 21.63 

ILE CD1 84 -28.67 -47.63 22.03 : ILE C 84 -25.32 -25.32 18.87 

ILE 0 84 -25.98 -44.71 18.65 : LEU N 85 -24.51 -24.51 17.94 

LEU CA 85 -24.36 -45.46 16.70 : LEU CB 85 -22.90 -22.90 16.51 

LEU CG 85 -22.27 -44.83 15.24 : LEU GDI 85 -22.67 -22.67 14.86 

LEU CD2 85 -20.78 -45.01 15.45 : LEU C 85 -24.93 -24.93 15.64 

LEU 0 85 -24.79 -47.57 15.74 : VAL N 86 -25.60 -25.60 14.65 

VAL CA 86 -26.00 -46.71 13.56 : VAL CB 86 -27.56 -27.56 13.57 

VAL CGI 86 -28.28 -46.04 14.42 : VAL CG2 86 -28.15 -28.15 12.19 

VAL C 86 -25.55 -46.05 12.27 : VAL 0 86 -25.74 -25.74 12.14 

LEU N 87 -24.88 -46.77 11.38 : LEU CA 87 -24.55 -24.55 10.11 

LEU CB 87 -23.25 -46.65 9.61 : LEU CG 87 -22.11 -22.11 10.54 

LEU CD1 87 -20.93 -47.07 9.70 : LEU CD2 87 -21.85 -21.85 11.28 

LEU C 87 -25.66 -46.73 9.22 : LEU 0 87 -26.00 -26.00 9.39 

VAL N 88 -26.30 -45.98 8.31 : VAL CA 88 -27.35 -27.35 7.46 

VAL CB 88 -28.47 -45.49 7.41 : VAL CGI 38 -29.45 -29.45 6.38 

VAL CG2 88 -29.12 -45.33 8.76 : VAL C 88 -26.69 -26.69 6.11 

VAL 0 88 -26.15 -45.62 5.66 : LYS N 89 -26.65 -26.65 5.38 

LYS CA 89 -25.94 -47.80 4.10 : LYS CB 89 -25.02 -25.02 4.15 

LYS CG 89 -23.71 -48.54 4.77 : LYS CD 89 -22.73 -22.73 5.27 

LYS CE 89 -21.54 -48.85 5.96 : LYS NZ 89 -20.87 -20.87 7.05 

LYS C 89 -26.98 -47.86 2.97 : LYS 0 89 -28.07 -28.07 3.24 

PRO N 90 -26.80 -47.35 1.74 : PRO CD 90 -25.54 -25.54 1.17 

PRO CA 90 -27.81 -47.17 0.73 : PRO CB 90 -27.18 -27.18 -0.39 

PRO CG 90 -25.82 -46.96 -0.33 : PRO C 90 -28.36 -28.36 0.26 

PRO 0 90 -29.58 -48.64 0.08 : ASP N 91 -27.55 -27.55 0.14 

ASP CA 91 -28.01 -50.83 -0.22 : ASP CB 91 -26.81 -26.81 -0.70 

ASP CG 91 -25.71 -51.69 0.33 : ASP ODl 91 -25.45 -25.45 1.04 

ASP 0D2 91 -25.10 -52.76 0.44 : ASP C 91 -28.68 -28.68 0.95 

ASP 0 91 -28.25 -52.60 1.40 : ASP N 92 -29.73 -29.73 1.52 

ASP CA 92 -30.47 -51.44 2.66 : ASP CB 92 -31.47 -31.47 2.22 

ASP CG 92 -32.80 -52.49 2.95 : ASP ODl 92 -33.66 -33.66 2.43 

ASP 0D2 92 -33.06 -51.80 3.97 : ASP C 92 -29.68 -29.68 3.77 

ASP 0 92 -30.30 -52.74 4.60 : ARG N 93 -28.36 -28.36 3.91 

ARG CA 93 -27.71 -52.49 5.11 ; ARG CB 93 -26.24 -26,24 4.79 

ARG CG 93 -25.90 -53. 9i 3.68 : ARC CD 93 -24.33 -24.33 3.58 

ARG NE 93 -23.80 -54.65 4.89 : ARG CZ 93 -23.11 -23.11 5.89 

ARG NH1 93 -22.86 -54.55 7.10 : ARC NH2 93 -22.59 -22.59 5.67 

ARC C 93 -27.82 -51.43 6.22 : ARG 0 93 -28.34 -28.34 6.03 

ARC N 94 -27.38 -51.77 7.43 : ARG CA 94 -27.28 -27.28 8.64 

ARG CB 94 -28.49 -51.02 9.53 : ARG CG 94 -29.32 -29.32 9.54 

ARG CD 94 -30.26 -49.70 8.35 : ARG NE 94 -31.09 -31.09 8.24 

ARG CZ 94 -31.62 -51.29 7.06 : ARG NH1 94 -32.34 -32.34 6.95 

ARG NH2 94 -31.49 -50.56 5.96 : ARG C 94 -26.14 -26.14 9.46 

ARG 0 94 -25.65 -52.66 9.15 : CLU N 95 -25.69 -25.69 10.48 

GLU CA 95 -24.63 -51.38 11.31 : CLU CB 95 -23.30 -23.30 10.84 
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GLU CG 95 -22.38 -52.01 10.37 

GLU OE1 95 -20.79 -51.95 8.57 

GLU C 95 -24.95 -50.70 12.61 

TYR N 96 -24.98 -51.44 13.73 

TYR CB 96 -26.41 -51.68 15.76 

TYR GDI 96 -28.61 -50.70 15.19 

TYR CD 2 96 -27.76 -52.44 13.82 

TYR CZ 96 -29.76 -51.34 13.24 

TYR C 96 -24.06 -51.07 15,88 

PHE N 97 -23.49 -50.03 16.46 

PHE CB 97 -21.24 -49.51 17.00 

PHE CD1 97 -21.31 -49.23 14.59 

PHE CE1 97 -20.82 -49.51 13.36 

PHE CZ 97 -19.81 -50.43 13.20 

PHE 0 97 -23.58 -48.66 18.77 

PHE CA 98 -22.81 -49.88 21.13 

PHE CG 98 -23.71 -50.44 23.39 

PHE CD2 98 -22.85 -50; 81 24.41 

PHE CE2 98 -23.04 -50.38 25.70 

PHE C 98 -21.37 -49.65 21.57 

LEU N 99 -21.13 -48.36 21.70 

LEU CB 99 -19.68 -46.53 21.20 

LEU CD1 99 -19.04 -47.67 19.14 

LEU C 99 -19.93 -47.49 23.54 

THR N 100 -18.86 -47.28 24.29 

THR CB 100 -19.58 -47.78 26.62 

THR CG2 100 -18.88 -49.07 27.01 

THR 0 100 -16.60 -47.53 25.72 

SER CA 101 -15.75 -45.57 27.49 

SER OG 101 -16.16 -44.81 29.79 

SER 0 101 -15.63 -47.76 28.43 

ASP CA 102 -12.69 -47.42 28.83 

ASP CG 102 -13.60 -47.61 31.27 

ASP OD2 102 -13.01 -47.94 32.32 

ASP 0 102 -12.99 -49.56 27.67 

ASN CA 103 -12.92 -51.11 30.03 

ASN CG 103 -13.56 -50.80 32.49 

ASN ND2 103 -14.73 -50.15 32.37 

ASH 0 103 -15.33 -50.69 29.83 

PRO CD 104 -15.18 -53.63 30.64 

PRO CB 104 -13.35 -54.64 29.25 

PRO C 104 -15.33 -54.33 27.65 

ILE N 105-15.07-55.33 26.75 

ILE CB 105 -16.66 -57.27 26.63 

ILE CGI 105 -17.66 -58.03 25.68 

ILE C 105 -17.21 -55.05 25.35 

PHE N 106-16.74-53.92 24.75' 

PHE CB 106 -18.48 -52.14 25.06 

PHE CD1 106 -20.28 -53.69 24.14 

PHE CE1 106 -21.47 -54.41 24.27 

PHE CZ 106 -22.21 -54.28 25.44 

PHE 0 106-17.52-51.66 22.05 

SER CA 107 -15.09 -50.38 22.50 

SER OG 107 -14.04 -50.06 20.37 

SER 0 107 -16.90 -48.77 22.83 

ASP CA 108 -14.95 -46.76 22.94 

ASP CG 108 -12.55 -46.26 23.80 

ASP 0D2 108 -12.00 -46.87 22.86 

ASP 0 108 -14.81 -44.84 21.62 
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8|!L 



JSTJ-8. 

HIS CA 109 -14.47 -45.96 19.31 
HIS CG 109 -12.21 -46.61 19.00 
HIS ND1 109 -11.80 -47.13 20.13 
HIS NE2 109 -10.81 -48.11 18.52 
HIS 0 109 -15.10 -47.83 18.08 
GLN CA 110 -15.62 -46.48 15.78 
GLN CG 110 -17.80 -47.48 14.97 
GLN 0E1 110 -16.83 -48.73 13.20 
GLN C 110 -15.33 -45.49 14.67 
LYS N 111 -15.06 -45.99 13.48 
LYS CB 111 -13.62 -45.88 11.64 
LYS CD 111 -12.32 -46.31 9.49 
LYS NZ 111 -9.93 -47.28 9.70 
LYS 0 111 -16.47 -46.07 10.95 
ILE CA 112 -17.76 -43.75 10.51 
ILE CG2 112 -19.90 -42.58 10.27 
ILE GDI 112 -19.66 -41.59 13.03 
ILE 0 112 -16.49 -42.41 9.10 
PRO CD 113 -18.03 -45.26 7.79 
PRO CB 113 -16.58 -44.96 6.01 
PRO C 113 -16.94 -42.47 6.08 
ALA N 114 -16.15 -41.73 5.35 
ALA CB 114 -15.47 -39.96 3.92 
ALA 0 114 -17.91 -40.47 2.70 
GLY CA 115 -20.23 -39.80 3.73 
GLY 0 115 -22.33 -40.62 3.36 
THR CA 116 -22.15 -42.66 5.54 
THR 0G1 116 -20.35 -44.09 6.13 
THR C 116 -23.11 -41.88 6.39 
ILE N 117 -24.40 -42.15 6.31 
ILE CB 117 -26.70 -41.56 6.42 
ILE CGI 117 -26.83 -40.39 5.47 
ILE C 117 -25.33 -42.10 8.52 
PHE N 118 -25.40 -41.42 9.65 
FHE CB 118 -23.88 -42.14 11.46 
PHE CD1 118 -23.21 -40.33 13.06 
PHE CE1 118 -22.61 -39.14 13.35 
PHE CZ 118 -21.94 -38:47 12.38 
PHE 0 118 -26'. 40 -40.18 11.74 
TYR CA 119 -27.52 -41.17 13.89 
TYR CG 119 -29.81 -42.53 13.49 
TYR CE1 119 -30.40 -44.23 12.00 
TYR CE2 ll9 -31.15 -44.38 14.27 
TYR OH 119 -31.57 -46.13 12.70 
TYR 0 119 -26.45 -42.78 15.23 
LEU CA 120 -26.42 -41.44 17.53 
LEU CG 120 -23.86 -40.75 17.90 
LEU CD2 120 -23.50 -41.32 19.23 
LEU 0 120 -28.22 -40.39 18.55 
VAL CA 121 -28.66 -41.97 20.61 
VAL CGI 121 -30.11 -43.46 19.21 
VAL C 121 -27.88 -42.08 21.90 
ASN N 122 -28.22 -41.40 23.00 
ASN CB 122 -27.36 -40.18 25.06 
ASN OD1 122 -26.35 -41.42 26.91 
ASN C 122 -28.42 -42.42 25.04 
ARG N 123 -27.98 -43.66 25.22 
ARC CB 123 -28.18 -45.97 25.51 
ARC CD 123 -26.34 -46.96 27.34 
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: LEU C 120 -27.47 -27.47 18.62 

: VAL N 121 -27.70 -27.70 19.56 

: VAL CB. 121 -29.94 -29.94 20.60 

: VAL CG2 121 -29.89 -29.89 21.59 

: VAL 0 121 -26.85 -26.85 21.88 

: ASN CA 122 -27.49 -27.49 24.27 

: ASN CG 122 -26.75 -26.75 26.48 

; ASN ND2 122 -26.65 -26.65 27.33 

: ASN 0 122 -29.52 -29.52 25.43 

: ARG CA 123 -28.84 -28.84 25.72 

: ARG CG 123 -26.73 -26.78 26.17 

: ARG NE 123 -26.83 -26.83 28.73 



WO 91/04270 

9/32 

Jnf. t-9. 

ARG CZ 123 -27.92 -47.53 29.20 

ARG NH2 L23 -28.69 -48.33 28.35 

ARG 0 123 -29.99 -45.29 27.68 

ASP CA 124 -28.41 -43.82 29.32 

ASP CG 124 -26.82 -43.92 31.10 

ASP 0D2 124 -26.50 -45.09 31.31 

ASP 0 124 -29.39 -41.67 29.34 

PRO CD 125 -30.08 -44.21 31.82 

PRO CB 125 -32.18 -43.01 31.95 

PRO C 125 -30.60 -41.19 32.61 

LYS N 126 -29.29 -41.29 32.93 

LYS CB 126 -28.84 -41.09 35.28 

LYS CD 126 -26.81 -42.73 35.39 

LYS NZ 126 -25.25 -42.56 37.35 

LYS 0 126 -27.33 -38.50 33.61 

GUI CA 127 -25.14 -39.81 32.71 

GLU CG 127 -23.20 -40.68 34.17 

GLU OEl 127 -24.84 -39.90 35.86 

GLU C 127 -25.15 -39.17 31.28 

ASP N 128 -24.51 -38.01 30.97 

ASP CB 128 -23.82 -36.04 29.56 

ASP OD1 128 -24.75 -33.82 29.78 

ASP C 128 -23.55 -38.30 28.66 

LEU N 129 -23.48 -37.96 27.37 

LEU CB 129 -23.33 -38.90 25.07 

LEU CD1 129 -23.13 -39.52 22.75 

LEU C 129 -21.54 -37.63 26.09 

ARC N 130 -20.24 -37.98 26.09 

ARG CB 130 -18.44 -36.57 26.87 

ARG CD 130 -18.43 -34.73 28.70 

ARG CZ 130 -16.05 -34.80 29.39 

ARG NH2 130 -15.64 -33.58 28.94 

ARG 0 130 -17.39 -38.20 24.75 

ILE CA 131 -17.68 -37.73 22.17 

ILE CG2 131 -17.86 -38.40 19.73 

ILE CD1 131 -20.52 -39.68 20.45 

ILE 0 131 -16.80 -35.58 21.98 

ILE CA 132 -14.31 -36.36 20.82 

ILE CG2 132 -12.38 -37.80 21.79 

ILE CD1 132 -12.16 -34.09 21.44 

ILE 0 132 -14.06 -38.21 19.37 

GLN CA 133 -13.61 -37.13 17.05 

GLN CG 133 -16.08 -37.06 16.81 

GLN OEl 133 -17.14 -37.00 14.66 

GLN C 133 -12.46 -36.54 16.31 

LEU N 134 -11.69 -37.35 15.61 

LEU CB 134 -9.42 -37.67 14.78 

LEU CD1 134 -7.62 -36.10 14.11 

LEU C 134 -11.35 -36.91 13.42 

ALA N 135 -11.48 -35.87 12.63 

ALA CB 135 -13.27 -34.89 11.39 

ALA 0 135 -10.30 -34.77 10.58 

MET CA 136 -10.17 -36.01 8.14 

MET CG 136 -8.72 -37.84 9.12 

MET CE 136 -10.20 -39.92 10.67 

MET 0 136 -11.64 -36.43 6.36 

PRO CD 137 -10.68 -33.11 7.14 

PRO CB 137 -12.24 -32.44 5.61 

PRO C 137 -11.66 -34.38 4.13 
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: ARG NH1 123 -28.26 -28.26 30.51 

• ARC C 123 -29.16 -29.16 27.17 

• ASP N 124 -28.41 -28.41 27.89 
: ASP CB 124 -27.00 -27.00 29.66 

• ASP ODl 124 -27.09-27.09 31.98 
: ASP C 124 -29.33 -29.33 29.92 



PRO 


N 


125 


-30 


.06 


-30.06 


31.04 


PRO 


CA 


125 


-31 


.11 


-31.11 


31 


.51 


PRO 


CG 


125 


-31 


.31 


-31.31 


32 


.70 


PRO 


0 


125 


•31.38 


-31.38 


33 


.15 


LYS 


CA 


126 


-28 


.78 


-28.78 


33 


.91 


LYS 


CG 


126 


-27 


.64 


-27.64 


36 


.08 


LYS 


CE 


126 


-25 


.85 


-25.85 


36 


.36 


LYS 


C 


126 


-27 


.42 


-27.42 


33 


.54 


GLU N 


127 


-26.40 


-26.40 


33.07 


GLU 


CB 


127 


-24 


.13 


-24.13 


32 


.94 


GLU 


CD 


127 


-23 


.80 


-23.80 


35 


.61 


GLU 


OE2 


127 


-23 


.27 


-23.27 


36 


.56 



: GUI 0 127 -25.88 -25.88 30.39 

: ASP CA 128 -24.42-24.42 29.59 

• ASP CG 128 -24.72 -24.72 30.18 
: ASP 0D2 128 -25.42 -25.42 .31.15 
: ASP 0 128 -22.91 -22.91 29.09 
: LEU CA 129 -22.63 -22.63 26.39 

• LEU CG 129 -22.98 -22.98 24.13 
: LEU CD2 129 -21.59 -21.59 24.30 
: LEU 0 129 -21.95 -21.95 25.88 
: ARG CA 130 -19.26 -19.26 25.66 

• ARC CG 130 -19.22 -19.22 27.86 
: ARG NE 130 -17.26 -17.26 29.25 
: ARG NH1 130 -15.10 -15.10 29.95 

• ARG C 130 -18.36 -18.36 24.52 
: ILE N 131 -18.52 -18.52 23.24 
: ILE CB 131 -18.61 -18.61 20.99 
: ILE CGI 131 -19.50 -19.50 21.44 
: ILE C 131 -16.58 -16.58 21.79 
: ILE N 132 -15.42 -15.42 21.31 
: ILE CB 132 -13.06 -13.06 21.79 
: ILE CGI 132 -11.95 -11.95 21.26 

• ILE C 132 -14.02-14.02 19.44 
: GLN N 133 -13.77 -13.77 18.28 

• GLN CB 133 -14.76 -14.76 16.13 
: GLN CD 133 -17.25 -17.25 15.86 

• GLN NE2 133 -18.47 -18.47 16.28 
: CUH 0 133 -12.25 -12.25 16.42 
: LEU CA 134 -10.65 -10.65 14.76 
: LEU CG 134 -8.39 -8.39 13.77 
: LEU CD2 134 -7.47 -7.47 13.73 

• LEU 0 134 -11.86 -11.86 13.10 
: ALA CA 135 -12.19 -12.19 11.40 
: ALA C 135 -11.15 -11.15 10.34 
: MET N 136 -11.13 -11.13 9.19 
: MET CB 136 -9.42 -9.42 7.85 
: MET SD 136 -8.72 -8.72 9.74 
: MET C 136 -11.01 -11.01 6.95 
: PRO N 137 -11.22 -11.22 6.55 
: PRO CA 137 -12.14 -12.14 5.49 
: PRO CG 137 -11.86 -11.86 7.02 
: PRO 0 137 -10.47 -10.47 3.85 



1 
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JZZj. /-ft 

VAX. N 138 -12.58 -34.81 3.28 

VAL CB 138 -13.48 -36.17 1.52 

VAL CC2 138 -13.41 -37.40 2.33 

VAL 0 138 -11.28 -34.37 0.02 

ASN CA 139 -12.74 -31.94 0.07 

ASN CG 139 -15.11 -32.24 -0.64 

ASN ND2 139 -14.75 -32.85 -1.75 

ASN 0 139 -11.19 -30.37 -0.93 

ASN CA 140 -10.74 -29.01 1.55 

ASN CG 140 -12.71 -27.61 1.05 

ASN ND2 140 -13.44 -27.87 0.00 

ASN 0 140 -11.72 -29.44 3.54 

PRO CD 141 -8.90 -27.39 3.54 

PRO CB 141 -8.61 -27.99 5.78 

PRO C 141 -11.01 -28.39 6.27 

GLN N 142 -12.13 -27.94 5.73 

GLN CB 142 -14.15 -26.64 5.77 

GLN CD 142 -13.43 -24.42 5.61 

GLN NE2 142 -12.42 -23.91 6.35 

GLN 0 142 -14.32 -29.46 5.42 

ILE CA 143 -15.73 -30.31 7.60 

ILE CG2 143 -15.41 -30.26 10.15 

ILE CD1 143 -16.22 -33.21 10.01 

ILE 0 143 -17.39 -28.61 7.99 

HIS CA 144 -19.34 -29.88 6.45 

HIS CG 144 -18.53 -28.73 4.36 

HIS ND1 144 -18.00 -27.57 4.68 

HIS NE2 144 -16.95 -28.35 3.06 

HIS 0 144 -20.12 -32.15 6.57 

GLU CA 145 -22.39 -31.56 7.97 

GLU CG 145 -21.61 -32.31 9.92 

GLU OEl 145 -20.21 -32.02 11.80 

GLU C 145 -23.67 -31.35 7.24 

PHE N 146 -24.10 -32.39 6.57 

PHE CB 146 -25.09 -32.99 4.51 

PHE CD1 146 -22.83 -32.34 3.63 

PHE CE1 146 -22.04 -31.65 2.77 

PHE CZ 146 -22.61 -30.85 1.83 

PHE 0 146 -26.43 -34.16 6.66 

PHE CA 147 -28.25 -32.85 8.20 

PHE CG 147 -27.96 -32.05 10.55 

PHE CD2 147 -28.03 -33.19 11.31 

PHE CE2 147 -27.23 -33.35 12.42 

PHE C 147 -29.38 -33.14 7.26 

LEU N 148 -29.97 -34.33 7.29 

LEO CB 148 -31.49 -36.11 6.51 

LEU CD1 148 -32.43 -38.14 5.53 

LEU C 148 -32.38 -33.95 7.01 

SER N 149 -32.37 -33.56 8.29 

SER CB 149 -33.30 -33.21 10.40 

SER C 149 -34.53 -31.98 8.70 

SER N 150 -34.58 -30.69 8.39 

SER CB 150 -37.05 -30.34 7.76 

SER C 150 -36.32 -29.80 10.00 

THR N 151 -35.99 -28.67 10.62 

THR CB 151 -34.87 -28.95 12.85 

THR CG2 151 -34.74 -30.43 13.15 

THR 0 151 -37.00 -27.46 13.80 

GLU CA 152 -36.43 -24.83 12.73 
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11/52 

JBf. /'//. 

GLU CG 152 -38.53 -25.06 14.54 

GLU OE1 152 -38.86 -24.28 16.80 

GLU C 152 -35.43 -24.88 13.88 

AIA N 153 -34.29 -24.55 13.27 

ALA CB 153 -32.52 -25.77 14.62 

AIA 0 153 -31.36 -23.84 12.22 

GLN CA 154 -31.77 -25.92 10.51 

GLN CG 154 -30.97 -28.24 11.41 

GLN OE1 154 -32.46 -29.28 9.86 

GLN C 154 -32.29 -26.35 9.15 

GLN N 155 -33.50 -26.68 8.69 

GLN CB 155 -33.41 -25,57 6.45 

GLN CD 155 -34.59 -24.56 4.35 

GUN NE2 155 -34.53 -24.23 3.04 

GLN 0 155 -31.92 -28.10 6.31 

SER CA 156 -33.32 -30.25 5.33 

SER OG 156 -34.05 -32.24 4.10 

SER 0 156 -33.19 -29,19 3.24 

TYR CA 157 -30.76 -30.07 2.49 

TYR CG 157 -29.27 -32.19 2.37 

TYR CE1 157 -28.72 -34.17 3.53 

TYR CE2 157 -29.51 -34.25 1.30 

TYR OH 157 -29.03 -36.25 2.43 

TYR 0 157 -31.46 -29.87 0.24 

LEU CA 158 -33.60 -31.62 0.34 

LEU CC 158 -34.74 -34.06 0.44 

LEU CD2 158 -34.18 -34.87 1.58 

LEU 0 158 -34.67 -30.20 -1.28 

GLN CA 159 -35.19 -28.14 0.45 

GLN CG 159 -36.42 -28.08 2.55 

GLN OE1 159 -36.07 -27.90 4.93 

GLN C 159 -34.41 -27.27 -0.51 

GLU N 160 -33.14 -27.46 -0.84 

GLU CB 160 -31.04 -26.48 -1.45 

GLU CD 160 -31.52 -24.58 0.25 

GLU OE2 160 -31.65 -23.66 -0.59 

GLU 0 160 -32.28 -26.13* -4.15 

PHE CA 161 -33.52 -28.52 -4.81 

PHE CG 161 -32.49 -30.74 -4.67 

PHE CD2 161 -31.58 -30.60 -5.67 

PHE CE2 161 -30.39 -31.25 -5.56 

PHE C 161 -34.73 -27.77 -5.30 

SER N 162 -34.89 -27.59 -6.60 

SER CB 162 -35.61 -26.30 -8.44 

SER C 162 -37.31 -27.46 -6.94 

LYS N 163 -38.45 -26.84 -7.21 

LYS CB 163 -40.84 -26.56 -7.22 

LYS CD 163 -43.08 -25.68 -6.74 

LYS NZ 163 -44.10 -27.66 -8.04 

LYS 0 163 -40.17 -29.76 -7.52 

HIS CA 164 -39.92 -29:49 -10.23 

HIS CG 164 -39.24 -30.07 -12.73 

HIS ND1 164 -38.45 -31.19 -12.61 

HIS NE2 164 -39.01 -31.07 -14.68 

HIS 0 164 -39.39 -31.79 -10.14 

ILE CA 165 -36.87 -31.57 -9.33 

ILE CG2 165 -34.53 -31.98 -8.41 

ILE CD1 165 -33.58 -30.40 -10.81 

ILE 0 165 -37.41 -33.62 -8.28 
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: LEU CD1 166 -36.66 -36.66 -3.74 

: LEU C 166 -39.74 -39.74 -6.38 

: GLU N 167 -40.65 -40.65 -7.09 

: GLU CB 167 -42.78 -42.78 -8.24 

: GLU CD 167 -44.52 -44.52 -8.19 

: GLU 0E2 167 -45.55 -45.55 -7.60 

: GLU 0 167 -42.08 -42.08 -8.22 

: ALA CA 168 -40.46 -40.46 -10.34 

: ALA C 168 -39.72 -39.72 -9.65 

: SER N 169 -38.77 -38.77 -8.78 

: SER CB 169 -37.07 -37.07 -7.08 

: SER C 169 -39.05 -39.05 -7.11 

: PHE N 170 -39.72 -39.72 -6.14 

: PHE CB 170 -40.81 -40.81 -3.95 

: PHE GDI 170 -38.85-38.85 -2.68 

: PHE CE1 170 -37.67 -37.67 -2.01 

: PHE CZ 170 -37.17 -37.17 -1.80 

: PHE 0 170 -42.70 -42.70 -5.20 

: ASN CA 171 -43.44 -43.44 -7.90 

: ASN CG 171 -44.74 -44.74 -9.18 

: ASN ND2 171 -45.21 -45.21 -9.32 

ASN 0 171 -45.26 -45.26 -6.40 

SER CA 172 -45.90 -45.90 -5.98 

SER OG 172 -45.71 -45.71 -3.99 

SER 0 172 -45.29 -45.29 -6.91 

LYS CA 173 -47.49 -47.49 -6.01 

LYS CG 173 -49.45 -49.45 -7.51 

LYS CE 173 -48.12 -48.12 -9.70 

LYS C 173 -46.74 -46.74 -4.89 

PHE N 174 -46.22 -46.22 -5.17 

PHE CB 174 -44.62 -44.62 -4.82 

PHE CD1 174 -42.48 -42.48 -3.41 

PHE CE1 174 -41.73 -41.73 -2.34 

PHE CZ 174 -42.14 -42.14 -1.65 

PHE 0 174 -45.15 -45.15 -1.85 

GLU CA 175 -47.58 -47.58 -1.48 

GLU CG 175 -50.07 -50.07 -2.47 

GLU OEl 175 -49.33 -49.33 -3.67 

GLU C 175 -47.78 -47.78 -0.88 

GLU N 176 -47.91 -47.91 -1.48 

GLU CB 176 -48.39 -48.39 -1.56 

GLU CD 176 -48.70 -48.70 0.07 

GLU 0E2 176 -48.73 -48.73 0.14 

GLU 0 176 -46.47 -46.47 1.19 

ILE CA 177 -44.04 -44.04 -0.46 

ILE CG2 177 -41.64 -41.64 -1.10 

ILE CD1 177 -42.17 -42.17 -3.84 

ILE 0 177 -43.05 -43.05 1.61 

ASN CA 178 -43.94 -43.94 1.71 

ASN CG 178 -43.69 -43.69 1.96 

ASN ND2 178 -43.77 -43.77 3.30 

ASN 0 178 -44.02 -44.02 4.14 

ARG CA 179 -46.74 -46.74 4.23 

ARG CG 179 -49.14 -49.14 4.89 

ARG NE 179 -50,02 -50.02 3.53 

ARG NH1 179 -49.70 -49.70 3.82 

ARC C . 179 -46.15 -46.15 4.82 

VAL N 180 -45.89 -45.89 4.03 
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JST. t% 

ILE CA 195 -36.53 -39.25 22.68 

ILE CG2 195 -36.41 -39.57 20.41 

ILE CD1 195 -34.46 -41.71 23.06 

ILE 0 195 -36.34 -39.35 25.07 

ASP CA 196 -37.98 -41.53 25.27 

ASP CG 196 -40.08 -42.81 25.97 

ASP 0D2 196 -40.21 -43.76 25.15 

ASP 0 196 -37.40 -43.83 24.50 

SER CA 197 -36.86 -44.22 27.41 

SER OC 197 -37.60 -42.68 29.18 

SER 0 197 -36.40 -46.33 26.38 

CLU CA 198 -39.08 -46.78 25.81 

GLD CG 198 -41.35 -46.78 27.01 

CLU 0E1 198 -41.91 -44.98 28.52 

GLD C 198 -38.45 -47.06 24,42 

GLN N 199 -38.43 -46.12 23.43 

GLN CB 199 -37.74 -45.11 21.28 

GLN CD 199 -39.18 -42.89 20.83 

GLN NE2 199 -39.59 -42.26 19.71 

GLN 0 199 -36.07 -48.15 21.75 

ILE CA 200 -34.13 -46.81 23.22 

ILE CG2 200 -31.99 -46.51 24.49 

ILE CD1 200 -33.59 -43.99 25.72 

ILE 0 200 -33.10 -48.96 22.80 

LYS CA 201 -34.70 -50.22 24.82 

LYS CG 201 -35.62 -52.09 26.34 

LYS CE 201 -36.35 -53.82 28.11 

LYS C 201 -34.53 -51.28 23.74 

GLU N 202 -35.49 -51.25 22.78 

GLD CB 202 -36.76 -52.38 21.01 

GLU CD 202 -36.64 -55.03 20.75 

CLU 0E2 202 -37.47 -55.63 21.48 

GLU 0 202 -33.33 -53.26 21.13 

LED CA 203 -32.64 -50.63 19.98 

LED CG 203 -31.99 -48.42 18.48 

LED CD2 203 -30.92 -49.27 17.86 

LED 0 203 -30.42 -51.45 20.23 

SER CA 204 -30.06 -50.93 22.91 

SER OG 204 -30.72 -49.47 24.62 

SER 0 204 -28.43 -52.69 22.61 

LYS CA 205 -30.57 -54.59 23.03 

LYS CG 205 -32.16 -56.55 23.41 

LYS CE 205 -34.57 -55.86 24.09 

LYS C 205 -29.80 -55.13 21.83 

HIS N 206 -30.44 -54.82 20.69 

HIS CB 206 -30.87 -54.72 18.31 

HIS CD2 206 -30.02 -56.52 16.69 

HIS CE1 206 -29.55 -55.30 14.97 

HIS C 206 -28.46 -54.70 19.19 

ALA N 207 -28.16 -53.42 19.47 

ALA CB 207 -26.83 -51.46 19.86 

ALA 0 207 -24.74 -54.22 19.60 

LYS CA 208 -25.19 -54.66 22.33 

LYS CG 208 -25.67 -53.54 24.54 

LYS CE 208 -26.62 -52.92 26.82 

LYS C 208 -24.74 -56.04 21.89 

SER N 209 -25.75 -56.67 21.30 

SER CB 209 -26.86 -58.46 20.24 

SER C 209 -24.55 -57.92 19.65 
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: GLN 0E1 199 -38.88 -38.88 21.83 
: GLN C 199 -36.38 -36.38 22.25 
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SER N 210 -24.84 -56.91 18.81 : SER CA 210 -24.14 -24.14 17.55 
SER CB 210 -24.79 -55.56 16.74 : SER 0G 210 -26.24 -26.24 16.62 
SER C 210 -22.69 -56.40 17.84 ; SER 0 210 -21.85 -21.85 17.14 
SER N 211 -22.43 -55.62 18.90 : SER CA 211 -21.06 -21.06 19.26 
SER CB 211 -21.18 -53.82 19.82 : SER OG 211 -22.03 -22.03 19.04 
SER C 211 -20.55 -56.28 20.30 : SER 0 211 -21.06 -21.06 20,27 
ARG N 212 -19.64 -55.88 21.24 : ARG CA 212 -19.04 -19.04 22.40 
ARG CB 212 -19.73 -57.98 22.87 : ARG CG 212 -21.16 -21.16 23.41 
ARG CD 212 -21.77 -59.17 22.72 : ARG NE 212 -23.25 -23.25 22.62 
ARG CZ 212 -24.10 -59.38 23.70 : ARG NH1 212 -25.47 -25.47 23 54 
ARG NH2 212 -23.53 -59.40 24.98 : ARG C 212 -17.62 -17.62 22 00 
ARG OT1 212 -16.65 -56.67 22.70 : ARG OT2 212 -17.56 -17.56 21.00 
ASN CB 220 -3.63 -53.54 26.53 : ASN CG 220 -4.61 -4.61 27.75 
ASN ODl 220 -5.39 -52.66 28.00 : ASN ND2 220 -4.60 -4.60 28.56 
ASN C 220 -2.45 -51.28 26.97 : ASN 0 220 -1.34 -1.34 26.60 
ASN N 220 -2.82 -52.06 24.65 : ASN CA 220 -3.41 -3.41 25.99 
THR N 221 -2.92-51.16 26.19 : THR CA 221 -2.36 -2.36 29.22 
THR CB 221 -3.37 -50,58 30.35 : THR OGl 221 -4.72 -4.72 29.83 
THR CG2 221 -3.28 -49.38 31.32 : THR C 221 -0.90 -0.90 29.67 
THR 0 221 -0.28 -51.48 29.55 : ILE N 222 -0.28 -0.28 30.16 
ILE CA 222 1.02 -49.41 30.76 : ILE CB 222 2.12 2.12 29.77 
ILE CG2 222 2.01 -47.64 29.26 : ILE CGI 222 3.44 3.44 30.50 
ILE CD1 222 4.70 -48.79 29.65 : ILE C 222 0.98 0.98 31.85 
ILE 0 222 0.40 -47.29 31.64 : GLY N 223 1.49 1.49 33^04 
GLY CA 223 1.38 -47.40 34.00 : GLY C 223 1.18 1.18 35 40 
GLY 0 223 0.89 -49.16 35.54 : ASN N 224 1.23 1.23 36.42 
ASN CA 224 1.45 -47.62 37.75 : ASN CB 224 2.90 2.90 38.00 
ASN CG 224 3.38 -45.94 38.17 ; ASN ODl 224 2.60 2.60 38 26 
ASN ND2 224 4.71 -45.80 38.28 : ASN C 224 0.57 0.57 38.76 
ASN 0 224 -0.55 -46.64 38.42 : GLU N 225 1.00 1.00 39.97 
GLU CA 225 0.17 -46.01 40.98 : GLU CB 225 0.87 0.87 42.36 
GLU CG 225 1.73 -47.04 42.87 : GLU CD 225 3.18 3.18 42.26 
GLU OEl 225 3.37 -47.69 41.13 : GLU 0E2 225 4.11 4.11 42.99 
GLU C 225 -0.24 -44.56 40.63 : GLU 0 225 -1.27 -1.27 41 14 
PHE N 226 0.53 -43.83 39.81 : PHE CA 226 0.27 0.27 39.57 
PHE CB 226 1.49 -41.64 39.60 : PHE CG 226 2.30 2.30 40.80 
PHE CD1 226 3.13 -42.94 40.85 : PHE CD2 226 2.20 2.20 41.81 
PHE CE1 226 3.90 -43.11 41.96 : PHE CE2 226 2.97 2.97 42.92 
PHE CZ 226 3.83 -42,21 42.99 : PHE C 226 -0.38 -0.38 38.26 
PHE 0 226 -0.82 -40.89 38.08 : GLY N 227 -0.27 -0.27 37.26 
GLY CA 227 -0.95 -42.66 36.02 : GLY C 227 -0.88 -0.88 35.17 
GLY 0 227 -0.09 -44.77 35.49 : ASN N 228 -1.66 -1.66 34.14 
ASN CA 228 -1.38 -45.14 33.23 : ASN CB 228 -1.77 -1.77 33.78 
ASN CG 228 -3.14 -46.74 34.35 : ASN ODl 228 -4.14 -4.14 33.82 
ASN ND2 228 -3.17 -47.51 35.44 : ASN C 228 -2.10 -2.10 31.97 
ASN 0 228 -2.97 -43.95 31.93 : LEU N 229 -1.38 -1.38 30.96 
LEU CA 229 -1.54 -44.92 29.56 : LEU CB 229 -0.13 -0.13 29.09 
LEU CG 229 0.32 -43.99 27.78 : LEU CD1 229 1.82 1.82 27.80 
LEU CD2 229 -0.01 -44.89 26.65 : LEU C 229 -2.10 -2.10 28.78 
LEU 0 229 -1.41 -47.11 28.73 : THR N 230 -3.25 -3.25 28.11 
THR CA 230 -3.60 -47.21 27.31 : THR CB 230 -4.69 -4.69 28.12 
THR OGl 230 -4.96 -49.13 27.32 : THR CG2 230 -5.95 -5.95 28.42 
THR C 230 -3.99 -46.79 25.88 : THR 0 230 -5.11 -5.11 25.53 
GLU N 231 -2.86 -46.84 25.15 : GLU CA 231 -2.62 -2.62 23.70 
GLU CB 231 -1.12 -46.64 23.50 : GLU CG 231 -0.49 -0.49 22.53 
GLU CD 231 0.30 -44.52 23.16 : GLU OEl 231 1.49 1.49 23.33 
GLU 0E2 231 -0.18 -43.40 23.45 : GLU C 231 -3.21 -3.21 22.85 
GLU 0 231 -3.09 -48.90 23.28 : ARG N 232 -3.78 -3.78 21.67 
ARC CA 232 -4.30 -48.65 20.79 : ARC CB 232 -5.79 -5.79 21.15 
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ARC NE 232 -8.92 -47.23 

ARG NH1 232 -9.35 -45.17 

ARC C 232 -4.25 -48.12 

THR N 233 -3.73 -48.78 

THR CB 233 -2.17 -48.08 

. THR CG2 233 -1.14 -48.51 

THR 0 233 -4.88 -49.78 

ASP CA 234 -5.81 -48.29 

ASP CG 234 -7.66 -47.55 

ASP 0D2 234 -7.07 -47.39 

ASP 0 234 -4.30 -47.12 

ASN CA 235 -3.64 -48.91 

ASN CG 235 -1.49 -49.33 

ASN ND2 235 -0.31 -49.97 

ASN 0 235 -4.34 -50.11 

SER CA 236 -6.03 -48.05 

SER OG 236 -8.00> -49.08 

SER 0 236 -7.88 -46.40 

LEU CA 237 -5.86 -44.50 

LEU CG 237 -8.19 -44.14 

LEU CD 2 237 -8.38 -42.70 

LEU 0 237 -3.94 -43.17 

ASN CA 238 -2.50 -45.17 

ASN CG 238 -0.31 -44.40 

ASN ND2 238 0.35 -44.56 

ASN 0 238 -1.76 -43.01 

VAL CA 239 -3.76 -43.13 

VAL CGI 239 -5.83 -41.98 

VAL C 239 -3.58 -43.91 

LEU N 240 -3.19 -43.34 

LEU CB 240 -1.75 -43.57 

LEU CD1 240 0.36 -44.71 

LEU C 240 -4.27 -43.54 

ILE N 241 -5.13 -44.19 

ILE CB 241 -7.53 -44.12 

ILE CGI 241 -8.03 -44,07 

ILE C 241 -5.61 -43.75 

SER N 242 -4.84 -42.83 

SER CB 242 -2.89 -42.33 

SER C 242 -5.18 -42.50 

SER N 243 -4.98 -42.78 

SER CB 243 -6.88 -43.62 

SER C 243 -5.10 -42.48 

ILE N 244 -4.89 -41.37 

ILE CB 244 -2.91 -40.11 

ILE CGI 244 -2.14 -40.57 

ILE C 244 -4.58 -40.84 

GLU N 245 -4.45 -41.43 

GLU CB 245 -6.16 -41.71 

GLU CD 245 -6.95 -44.08 

GLU 0E2 245 -7.98 -43.78 

GLU 0 245 -3.11 -41.63 

MET CA 246 -2.83 -39.38 

MET CG 246 -0.94 -38.81 

MET CE 246 0.79 -36.91 

MET 0 246 -4.41 -38.01 

GLU CA 247 -3.96 -38.40 

GLU CG 247 -4.18 -40.09 
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VAL C 
GLU N 
GLU CB 
GLU CD 
GLU 0E2 
GLU 0 
LEU CA 
LEU CG 
LEU CD2 275 
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VAL CA 276 
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GLY N 
GLY C 
PRO N 
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276 
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-0.32 
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-4 
-6 
-8.35 
-9.66 
-7.44 
-7.68 
-6.24 
-9.18 
-9.63 
-11.28 
-11.63 
-11.79 
-11.17 
-12.21 
-11.48 
-11.73 
-9.30 
-8.82 
-8.19 
-5.97 
-6.99 
-8.33 
-4.56 
-3.61 
-2.18 
-1.76 
0.89 
2.91 
4.00 
4.29 
2.20 
2.79 
1.40 
4.01 
5.13 
7.44 
8.08 
9.30 
7.00 
7.31 
5.93 
6.65 
8.77 
10.96 
13.50 
11-13 
11.26 
12.42 
12.68 



-36.27 

-36.96 

-35.03 
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-33.85 

-34.21 

-33.28 
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22.00 

24.43 
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28.43 
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35.97 
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PRO CA 278 13.80 -25.39 18.52 
PRO CG 278 12.73 -26.38 16.77 
PRO 0 278 12.09 -23.75 18.13 
LYS CA 279 13.66 -21.64 18.99 
LYS CG 279 14.66 -19.49 19.77 
LYS CE 279 16.41 -18.44 21.24 
LYS C 279 13.20 -21.04 17.69 
GLY N 280 11.90 -20.86 17.59 
GLY C 280 11.56 -21.30 15.08 
ASN N 281 12.45 -22.36 14.82 
ASK CB 281 12.81 -21.85 12.40 
ASN OD1 281 14.83 -21.15 11.34 
ASN C 281 14.12 -23.72 13.19 
LYS N 282 14.30 -24.10 11.91 
LYS CB 282 15.23 -25.00 9.67 
LYS CD 282 17.00 -24.72 8.02 
LYS N2 282 19.33 -23.53 7.97 
LYS 0 282 16.64 -22.86 11.65 
GLU CA 283 19.14 -24.13 11.85 
GLU CG 283 21.12 -22.73 10.94 
GLU OE1 283 22.70 -20.97 11.49 
GLU C 283 18.86 -23.83 13.33 
THR N 284 19.65 -23.13 14.13 
THR CB 284 18.09 -22.43 15.86 
THR CG2 284 16.99 -23.14 16.65 
THR 0 284 18.84 -25.55 15.77 
LEU CA 285 20.65 -25.71 17.81 
LEU CG 285 22.68 -27.39 17.30 
LEU CD2 285 24.13 -27.12 17.10 
LEU 0 285 20.42 -25.57 20.19 
GLU CA 286 18.88 -23.32 20.26 
GLU CG 286 19.99-21.12 20.00 
GLU OE1 286 19.72 -19.45 21.61 
GLU C 286 17.49 -23.85 20.58 
TYR N 287 17.27 -24.58 21.68 
TYR CB 287 16.08 -26.36 22.91 
TYR CD1 287 18.18 -27.70 22.64 
TYR CD2 287 16.51 -27.79 20.94 
TYR CZ 287 18.64 -28.84 20.64 
TYR C 287 15.31 -23.94 22.94 
GLU N 288 14.03 -23.98 23.26 
GLU CB 288 13.12 -21.89 22.99 
GLU CD 288 11.21 -20.47 22.42 
GLU OE2 288 11.35 -19.69 21.43 
GLU 0 288 11.59 -24.35 23.70 
SER CA 289 10.34 -24.20 25.99 
SER OG 289 10.80 -24.25 28.45 
SER 0 289 8.98 -22.35 26.51 
TYR CA 290 6.65 -23.72 25.39 
TYR CG 290 7.03 -23.76 22.91 
TYR CE1 290 7.52 -22.22 21.17 
TYR CE2 290 8.96 -24.07 21.55 
TYR OH 290 9.50 -22.46 19.92 
TYR 0 290 5.68 -25.60 26.56 
ARC CA 291 5.03 -24.32 28.76 
ARC CG 291 7.20 -23.95 30.12 
ARG NE 291 9.03 -23.39 31.87 
ARG NH1 291 11.30 -23.90 31.88 
ARG C 291 3.60 -23.85 28.61 
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: THR OG1 284 18.62 18.62 16.54 

: THR C 284 19.47 19.47 16.10 

: LEU N 285 20.51 20.51 16.90 

: LEU CB 285 22.14 22.14 17.86 

: LEU GDI 285 22.18 22.18 15.97 

:'LEUC 285 20.13 20,13 19.10 

: GLU N 286 19.44 19.44 19.07 

: GLU CB 286 18.70 18.70 20.15 

: GLU CD 286 19.72 19.72 20.39 

: GLU OE2 286 19.34 19.34 19.54 

: GLU 0 286 16.60 16.60 19.79 

: TYR CA 287 15.94 15.94 22.10 

: TYR CG 287 16.93 16.93 22.16 

: TYR CE1 287 19.06 19.06 21.90 

: TYR CE2 287 17.36 17.36 20.18 

: TYR OH 287 19.55 19.55 19.74 

: TYR 0 287 15.99 15.99 23.19 

: GLU CA 288 13.35 13.35 23.99 

: GLU CG 288 12.36 12.36 23.42 

: GLU OEl 288 10.21 10.21 22.64 

: GLU C 288 12.09 12.09 24.50 

: SER N 289 11.54 11.54 25.70 

: SER CB 289 10.39 10.39 27.30 

: SER C 289 9.05 9.05 26.02 

: TYR N 290 8.02 8.02 25.56 

: TYR CB 290 6.13 6.13 24.04 

: TYR CD1 290 6.71 6.71 22.20 

: TYR CD2 290 8.15 8.15 22.57 

: TYR CZ 290 8.64 8.64 20.87 

: TYR C 290 5.91 5.91 26.55 

: ARG N 291 5.68 5.68 27.65 

: ARG CB . 291 5.68 5.68 30.03 

: ARG CD 291 7.60 7.60 31.54 

: ARG CZ 291 10.04 10.04 31.53 

: ARG NH2 291 9.85 9.85 30.84 

: ARG 0 291 3.39 3.39 27.95 
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0.67 
0.88 
-1.37 
-1.15 
1.24 
-2.69 
-2.84 
-3.60 
-2.27 
-5.21 
-6.45 
-7.96 
-8.81 
-9.95 
-12.41 
-14.71 
-14.55 
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-13.48 
-13.29 
-10.20 
-7.72 
-6.97 
-7.78 
-7.13 
-5.80 
-7.82 
-4.41 
-3.41 
-1.28 
0.35 
1.62 
2.69 
-1.88. 
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3.16 
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ILE 0 323 0.91 -39.08 13.95 : ASN N 324 

ASN CA 324 1.36 -40.63 11.66 : ASN CB 324 

ASN CG 324 3.61-39.91 10.86 : ASN 0D1 324 

ASN ND2 324 3.45 -40.56 9.72 : ASN C 324 

ASN 0 324 3.16 -41.93 12.70 : ALA N 325 

ALA CA 325 1.65 -43.50 14.07 : ALA CB 325 

ALA C 325 2.38 -44.76 13.70 : ALA 0 325 

ASN N 326 2.55 -45.29 12.47 : ASN CA 326 

ASN CB 326 2.82 -47.23 10.93 : ASN CC 326 

ASN OD1 326 0.87-47.01 9.56 : ASN ND2 326 

ASN C 326 4.61 -46.70 12.58 : ASN 0 326 

ASN N 327 4.98 -47.71 13.34 : ASN CA 327 

ASN CB 327 7.35 -47.99 12.71 : ASN CG 327 

ASN OD1 327 6.82-50.20 11.92 : ASN ND2 327 

ASN C 327 6.72 -46.68 14.64 ; ASN 0 327 

ASN N 328 5.77 -46.08 15.35 : ASN CA 328 

ASN CB 328 4.82-44.35 16.70 : ASN CG 328 

ASN OD1 328 5.90 -42.44 17.53 : ASN ND2 328 

ASN C 328 6.97 -45.47 17.33 : ASN 0 328 

ASN N 329 7.67 -44.78 18.22 : ASN CA 329 

ASN CB 329 9.89 -45.51 18.73 : ASN CG 329 

ASN OD1 329 9.58 -47.73 18.07 : ASN ND2 329 

ASN C 329 8.55 -44.47 20.37 : ASN 0 329 

ARC N 330 8.14 -44,86 21.55 : ARC CA 330 

ARG CB 330 6.89 -44.41 23.51 : ARC CG 330 

ARG CD 330 5.11 -43.97 25.14 : ARC NE 330 

ARG CZ 330 3.99 -46.12 24.50 : ARG NH1 330 

ARG NH2 330 4.91 -46.84 25.15 : ARG C 330 

ARG 0 330 9.70 -44.80 23.86 : ASN N 331 

ASN CA 331 11.41 -42.77 23.94 : ASN CB 331 

ASN *CG 331 12.74 -42.83 21.95 : ASN OD1 331 

ASN ND2 331 12.06 -43.07 20.83 : ASN C 331 

ASN 0 331 11.03 -40.96 25.28 : LEU N 332 

LEU CA 332 11.57 -42.06 27.71 : LEU CB 332 

LEU CG 332 9.51 -42.60 28.75 : LEU CD1 332 

LEU CD2 332 9.00 -43.07 30.05 : LEU C 332 

LEU 0 332 13.92 -42.19 27.84 : LEU N 333 

LEU CA 333 14.30 -39.89 29.27 : LEU CB 333 

LEU CG 333 14.11 -38.13 27.39 : LEU CD1 333 

LEU CD2 333 15.19 -38.68 26.51 : LEU C 333 

LEU 0 333 15.45 -39.45 31.34 : ALA N 334 

ALA CA 334 13.88 -40.85 32.93 : ALA CB 334 

ALA C 334 13.23 -42.15 33.31 : ALA 0 334 

GLY N 335 13.75 -42.88 34.28 : GLY CA 335 

GLY C 335 13.54 -45.31 34.12 : GLY 0 335 

LYS N 336 12.87 -46.42 34.49 : LYS CA 336 

LYS CB 336 13.04 -48.81 35.07 : LYS CG 336 

LYS CD 336 14.36 -49.56 37.16 : LYS CE 336 

LYS N2 336 12.17 -50.26 38.34 : LYS C 336 

LYS 0 336 13.56 -48.46 31.84 : THR N 337 

THR CA 337 11.01 -48.80 31.20 : THR CB 337 

THR OG1 337 9.28 -49.63 32.49 : THR CG2 337 

THR C 337 11.36 -47.82 30.07 : THR 0 337 

ASP N 338 11.73 -48.28 28.88 : ASP CA 338 

ASP CB 338 10.67 -47.02 27.23 : ASP CG 338 

ASP OD1 338 8.62 -48.00 26.63 : ASP OD2 338 

ASP C 338 12.94 -46.16 27.89 : ASP 0 338 

ASN N 339 13.99 -46.17 28.68 : ASN CA 339 

ASN CB 339 15.41 -45.12 30.33 : ASN CG 339 
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22.02 24.09 
24.10 25.81 

24.01 27.15 

24.85 28.29 
24.78 27.19 

23.20 27.33 
23.10 29.73 
20.70 29.57 
20.88 29.57 

24.97 25.29 

25.91 23.60 
27.35 23.85 

27.98 22.82 
29.42 21.68 
29.84 20.99 

30.24 23.02 
30.32 24.09 
31.23 22.75 

28.86 22.25 

30.37 25.02 
29.05 25.01 
28.69 26.78 
29.32 27.84 
30.32 30.02 
29.72 32.26 
29.86 34.19 
30.50 28.92 

32.32 26.84 
34.44 26.56 

35.61 26.18 

31.62 \ 26.47 
29.67 24.57 

27.96 22.80 

28.92 25.56 

28.33 26.60 
27.01 28.63 
25.00 29.54 

28.63 28.28 
29.86 28.49 

31.21 28,34 
31.23 26.98 
32.10 24.83 
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LEU CG 356 33.37 -38.20 24.78 : LEU CD1 356 33.32 33.32 23.64 

LEU CD2 356 34.49 -37.20 24.74 : LEU C 356 30. 27 30.27 25.89 

LEU 0 356 30.27 -35.10 25.58 : THR N 357 29.12 29.12 26.01 

THR CA 357 27.83 -36.31 25.86 : THR CB 357 26.70 26.70 25.68 

THR 0G1 357 26.92 -38.07 24.49 : THR CG2 357 25.35 25,35 25.58 

THR C 357 27,53 -35.47 27.09 : THR 0 357 26.79 26.79 26.87 

PHE H 358 28.02 -35.62 28.33 : PHE CA 358 27.39 27.39 29.47 

PHE CB 358 26.47 -35.97 30.09 : PHE CG 358 25.06 25.06 29.59 

PHE CD1 358 24.36 -34.84 29.44 : PHE CD2 358 24.47 24.47 29.34 

PHE CE1 358 23.04 -34.89 29.05 : PHE CE2 358 23.16 23.16 28.94 

PHE CZ 358 22.44 -36.11 28.80 : PHE C 358 28.24 28,24 30.59 

PHE 0 358 27.73 -34.15 31.70 : SER N 359 29.52 29.52 30.46 

SER CA 359 30.38 -33.45 31.53 : SER CB 359 29.74 29.74 32.60 

SER OG 359 28.66 -31.61 32.20 : SER C 359 30.75 30.75 32.29 

SER 0 359 31.70 -35.39 31,86 : GLY N 360 29.89 29.89 33.24 

GLY CA 360 30.18 -36.17 34,16 : GLY C 360 30.62 30.62 33.47 

GLY 0 360 30.41 -37.67 32.29 : SER H 361 31.29 31,29 34.22 

SER CA 361 31.74 -39.47 33.66 : SER CB 361 32.51 32,51 34.78 

SER OG 361 31.66 -40.19 35.95 : SER C 361 30.59 30.59 33.15 

SER 0 361 29.51 -40.44 33.77 : GLY N 362 30.79 30.79 32.08 

GLY CA 362 29.83 -42.22 31.71 : GLY C 362 29.30 29.30 32.90 

GLY 0 362 28.19 -43.64 32.91 : ASP K 363 30.08 30.08 33.97 

ASP CA 363 29.68 -43.82 35.15 : ASP CB 363 30.94 30.94 35.85 

ASP CG 363 30,95 -45.76 35.56 : ASP 0D1 363 31.71 31.71 34.69 

ASP 0D2 363 30,09 -46.47 36.16 : ASP C 363 28.80 28.80 36.09 

ASP 0 363 27.92 -43.68 36.73 : GLU K 364 29.17 29.17 36.27 

GLU CA 364 28.36 -40.89 37.06 : GLU CB 364 29.01 29.01 37.10 

GLU CG .364 29.84 -39.47 38.34 : GLU CD 364 30.59 30.59 38.50 

GLU OEl 364 30.40 -37.53 39.55 : GLU OE2 364 31.36 31.36 37.61 

GLU C 364 26.95 -40.73 36.45 : GLU 0 364 25.93 25.93 37.14 

VAL N 365 26.86 -40.40 35.14 : VAL CA 365 25.58 25.58 34.51 

VAL CB 365 25.75 -39.58 33.07 ; VAL CGI 365 26.75 26.75 32.21 

VAL CG2 365 24.43 -39.73 32.35 : VAL C 365 24.90 24.90 34.47 

VAL 0 365 23.70 -41.51 34.75 : MET N 366 25.51 25.51 34.27 

MET CA 366 24.71 -43.98 34.29 : MET CB 366 25.50 25.50 34.01 

MET CG 366 25.73 -45.78 32.58 : MET SD 366 24.18 24.18 31.65 

MET CE 366 23.34 -47.27 32.56 : MET C 366 24.05 24.05 35.66 

MET 0 366 22.87 -44.55 35.78 : LYS K 367 24.80 24.80 36.71 

LYS CA 367 24.41 -44.04 38.09 : LYS CB 367 25.51 25.51 39.03 

LYS CG 367 26.21 -44.63 39.88 : LYS CD 367 27.21 27.21 39.04 

LYS CE - 367 28.63 -45,39 39.71 : LYS KZ 367 29,74 29.74 38.73 

LYS C 367 23.15 -43.27 38.42 : LYS 0 367 22.15 22.15 38.91 

LEU K 368 23.23 -41.99 38.09 : LEU CA 368 22.13 22.13 38.34 

LEU CB 368 22.56 -39.70 37.96 : LEU CG 368 21.53 21.53 37.67 

LEU CD1 368 21.08 -37.93 38.94 : LEU CD2 368 22.17 22.17 36.66 

LEU C 368 20.93 -41.47 37.50 : LEU 0 368 19.80 19.80 38.01 

ILE K 369 21.17 -42.00 36.29 : ILE CA 369 20.06 20.06 35.42 

ILE CB 369 20.67 -42.72 34.00 : ILE CC2 369 19.88 19.88 33.20 

ILE CGI 369 20.72 -41.40 33.22 : ILE CD1 369 19.35 19.35 32.98 

ILE C 369 19.38 -43.54 36.14 : ILE 0 369 18.16 18.16 36.31 

ASK K 370 20.22 -44.40 36.69 : ASK CA 370 19.67 19.67 37.50 

ASK CB 370 20,75 -46.52 37.64 : ASK CG 370 20.60 20.60 36.39 

ASK OD1 370 21.46 -47.53 35.50 : ASK KD2 370 19.41 19.41 36.22 

ASK C 370 19.07 -45.03 38.86 : ASK 0 370 18.37 18.37 39.40 

LYS K 371 19.21 -43.82 39.43 : LYS CA 371 18.56 18.56 40.72 

LYS CB 371 18.79 -42.03 41.22 : LYS CG 371 20.26 20.26 41.48 

LYS CD 371 20.31 -40.52 42.58 : LYS CE 371 21.71 21.71 42.92 

LYS KZ 37i 22.07 -38.78 42.07 : LYS C 371 17.04 17.04 40.81 

LYS 0 371 16.53 -43.99 '41.88 : CUT K '372 16.31 16.31 39.70 
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